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STUDIES OF DISTRIBUTED PRACTICE: II. LEARNING 
AND RETENTION OF PAIRED-ADJECTIVE LISTS 
WITH TWO LEVELS OF INTRA-LIST SIMILARITY 


BY BENTON J. UNDERWOOD 


Northwestern University! 


In the first study in this series it was 
shown that serial lists of highly simi- 
lar adjectives were learned more 
rapidly by distributed than by massed 
practice (11). Lists made up of dis- 
similar items tended to be learned 
more rapidly by distribution, but 
differences were not significant statisti- 
cally. Thus, it was concluded that 
intra-task similarity is a variable 
which must be accounted for in any 
theory of distributed practice of ver- 
bal materials. Furthermore, the re- 
sults demonstrated that the number 
of errors made during learning was 
directly related to facilitation pro- 
duced by distributed practice. More 
errors per trial occurred under the 
distributed conditions than under the 
massed conditions. This fact was 
taken as evidence against the Mc- 
Geoch version of the differential-for- 
getting theory, since that theory pre- 
dicts a decrease in the number of 
errors following a rest period. On 
the positive side the evidence con- 
formed to the general outline of Gib- 
son’s theory of verbal learning since 


1 These experiments were made possible by a 
grant from The Graduate School, Northwestern 
University. 


that theory would predict an increase 
in error frequency following rest inter- 
vals, the increase being attributed to 
the recovery of generalization tenden- 
cies (2). Since Gibson’s theory does 
not have a specific accounting of the 
influence of distributed practice on 
learning, details of the findings could 
not be checked against the theory. 

In the present experiments the in- 
vestigation has been extended to 
paired-associate lists with two degrees 
of similarity, one in which intra-list 
similarity is slight, and one in which 
it is very great. These lists are pre- 
sented under different degrees of dis- 
tribution. In addition, retention 
measurements have been taken after 
three different intervals. 


PROCEDURE 


Materials.—For each level of similarity three 
lists were required. A sample of the lists is pre- 
sented below. In constructing the high-simi- 

Low Similarity 
lovely—resting 
uphill—filmy 
oldish—instant 
drastic—latent 
pious—mature 
rancid—bar 
adrift—equal 
jellied—aware 
nimble—vagrant 
human—waxen 


High Similarity 
clumsy—twofold 
double—soiled 
dirty—frigid 
arctic—devoid 
awkward—unclean 
vacant—dual 
paired—bungling 
frozen—empty 
filthy—icy 
unfilled—-graceless 
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larity lists an attempt was made to maximize the 
similarity relationships among the items; thus, 
five sets of four highly synonymous words, taken 
from Haagen’s lists (4), wereemployed. Similar 
items appeared both as stimuli and as responses; 
therefore, it is presumed that stimulus generaliza- 
tion, response generalization, and stimulus-re- 
sponse generalization occurred. 

The lists of adjectives were presented by a 
Hull-type drum with a 2-sec. exposure of the 
stimulus followed by 2 sec. in which the stimulus 
and response were presented together. The lists 
were presented in three different orders to mini- 
mize serial learning. Anticipation learning was 
used throughout. 

Conditions —The common feature of the six 
experiments was the three degrees of distribu- 
tion. In the most massed condition 4 sec. 
elapsed between trials; in a second condition, 
30 sec. occurred between trials, and in the third 
condition, 2 min. was used as the distribution 
interval. The differences among the experi- 
ments are shown below. Exp. I, II, and III 
employed the low-similarity lists; Exp. IV, V, 
and VI, the high-similarity lists. The retention 
intervals were 10 min., 24 hr., and 48 hr. 
Retention 
Interval 


10 min. 
24 hr. 
48 hr. 


Low 


High Retention 
Similarity 


i 

Similarity Interval 
Exp. IV 
Exp. V 
Exp. VI 


Exp. I 
Exp. Il 


24 hr 
Exp. III 


48 hr. 

For all conditions the learning before the re- 
tention interval was carried to 8 out of 10 correct 
responses on a single trial. After the retention 
interval, relearning was carried to one errorless 
trial by massed practice. 

For each experiment, 18 Ss were used. The 
three conditions of distribution were completely 
counterbalanced among these 18 Ss, with lists 
held constant from day to day. Since the 
original learning was under identical conditions 
for Exp. I, I, and III, and for Exp. IV, V, and 
VI, 54 Ss are available for determining the in- 
fluence of distribution on learning the lists of low 
and high similarity. For the retention measure- 
ments, however, only 18 Ss are available for a 
given condition of distribution, similarity, and 
retention interval. 

All Ss were given one practice day during 
which they learned a single list of paired-associ- 
ates, partly by massing (4 sec. between trials), 
and partly by distribution (30 sec. between 
trials). This list was recalled after the same 
period used for that given experiment on the ex- 
perimental days. During learning with 4 sec. 
between trials a single blank space appeared on 
the drum after each trial. For distribution at 
30 sec. and 2 min., S named colors from a color 
board at the rate of one per sec. as timed by a 
metronome. In Exp. I and IV, in which the 
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retention interval was 10 min., Ss were kept in 
the laboratory and practiced operating a motor 
division-of-attention test. 

At recall all Ss were provided explicit instruc- 
tions to give as many responses as possible on 
the first trial and to continue with the list until 
they could say it perfectly. Although S was 
allowed sufficient time to set himself for the re- 
call, no formal warm-up task was given (7). 
Relearning in all conditions was by massed 
practice. In no case did S learn a new list on a 
day that he relearned a previous one. 


RESULTS 


Learning as a function of distribution 
and intra-task similarity.—The influ- 
ence of these two variables on rate of 
learning the lists to a criterion of eight 
correct responses is given in Table I. 
The results for each experiment are 
presented separately to demonstrate 
the consistency of the results from ex- 
periment to experiment. 

The two major facts shown by the 
data in Table I are: (a) The high- 
similarity lists took much longer to 
learn than did the low-similarity lists; 
(6) No significant difference is present 
as a function of degree of distribution 
for the low-similarity lists nor for the 
high-similarity lists. 

It is conceivable that while the num- 
ber of trials required to reach a cri- 
terion of eight correct responses might 
not differ as a function of the degree 
of distribution, the rate of learning 
might change as a function of stage 
of learning. The learning curves for 
the low- and high-similarity lists for 
the three degrees of distribution have 
been plotted, and it is evident that 
little difference in the shapes of the 
curves exists. 

Errors during learning.—Some evi- 
dence was presented in the previous 
paper in this series to the effect that if 
distributed practice facilitates learn- 
ing, more errors will be present with 
distribution than with massing (11). 
In the present experiments it has been 
found that distribution does not facili- 
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TABLE I 


Tue Inrivence or Distrisputep Practice AND InTRA-Task SIMILARITY ON THE 
LEARNING OF Patrep-AssociaTE Lists 








Time Between Trials 





Type of List 


30 sec. 








Low Similarity 














16.78 
15.22 
14.33 
15.44 














tate learning. It may therefore be 
expected that the conditions—for a 
given level of similarity—will not 
demonstrate error differences. The 


data are presented in Table II in 
terms of mean number of errors per 


trial. To obtain these values the 
number of errors made by each S for 
a given condition was divided by the 
number of trials required to learn. 
The mean of these values for 54 Ss are 
the values given in Table II. 

Table II shows that the frequency 
of errors was greater for the high-simi- 
larity lists than for the low, but that 
for a given degree of similarity the 
error frequency does not vary signifi- 
cantly for different degrees of dis- 


TABLE Il 


Mean Errors Per Triat Durinc LEARNING 








Degree of Similarity 




















tribution. The largest difference, 
that between 4-sec. distribution and 
30- or 60-sec. distribution for high 
similarity, does not even approach 
statistical significance. While the er- 
ror curves as a function of stage of 
learning will not be presented it can 
be reported that the shapes of these 
curves did not vary as a function of 
degree of distribution. 

Recall.—Retention as measured by 
recall will be presented in terms of 
mean number of items forgotten. 
The values were determined for each 
S by subtracting the number of items 
recalled from the number of items 
given correctly on the last original 
learning trial. The means of these 
values for each condition are presented 
in Fig. 1 and Table III. 

Looking first at the data for the low- 
similarity lists it is to be noted that, 
in general, forgetting is less following 
learning by distributed practice than 
following massed (4 sec.). At the 
24-hr. retention point both of the 
distributed conditions show signifi- 
cantly less loss than the 4-sec. condi- 
tion. The difference between the 4- 
and 30-sec. conditions is 1.22+.51 
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tN w S uw o 


MEAN NUMBER ITEMS FORGOTTEN 


LOW SIMILARITY 





1 


iL 


tT T 
TIME BETWEEN TRIALS 

4 SEC. 

coceees -30 SEC 

—-—--—2 MIN. 


HIGH SIMILARITY 











10 MIN. 24 HR 


48 HR. 10 MIN 
RETENTION 


1 
24 HR 


INTERVAL 


Fic. 1. Forgetting of paired adjectives as a function of intra-list similarity, length of 
interval between trials, and length of retention interval 


items (t = 2.39). The difference be- 
tween the 4-sec. and 2-min. conditions 
is 1.45+.39 (¢ = 3.72). For 17 df a 
t of 2.11 is significant at the .05 level 
of confidence and a t of 2.90 is neces- 
sary for the .01 level. After 48 hr. 
only the 2-min. condition of distribu- 


TABLE III 


Mean Numser oF Items ForGOTTEN AS A 
Function or Time, Intrra-List 
SIMILARITY, AND DiIsTRIBUTION 

OF PRACTICE 








| Time Between Trials 





Interval 2 min. 





M 


ce Similarity 
10 min 
24 hr. 
48 hr. 
High Simi 
10 min. 
24 hr. 

















tion yields better retention than the 
massed condition. 

In contrast to the low-similarity 
lists, the high-similarity lists show 
little if any facilitation in retention 
following distribution. None of the 
differences between the massed and 
distributed conditions is significant. 
With the 2-min. distribution retention 
is consistently better than the massed 
condition, but clearly, as compared 
with the low-similarity lists, distribu- 
tion of practice facilitates retention 
but little. 

A third set of facts revealed is that 
high-similarity lists are retained some- 
what better than low-similarity lists. 
According to Gibson’s theory (2), 
forgetting should be greater the 
greater the amount of generalization 
within the list. Table III and Fig. 2 
show this not to be the case. In Gib- 
son’s own experiment (3) it was found 
that high-similarity and low-similarity 
lists were retained about equally well 
after 24hr. Her paired-associate lists 
had geometric forms for stimuli and 
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nonsense syllables as responses. In 
defense of her theory Gibson argues 
that the reason retention was unaf- 
fected by intra-list stimulus similarity 
was because the items of high-similar- 
ity lists had more repetitions during 
learning than did those of low-similar- 
ity lists (3, p. 189). In short, this im- 
plies that with equal number of rein- 
forcements during original learning, 
retention will be better (forgetting less) 
for low-similarity lists than for high- 
similarity lists. This expectation will 
be tested next. 

Item analysis of recall scores.—To 
make this test of the Gibson hypothe- 
sis the number of correct responses 
for each item in the original learning 
was tallied. For each item, “what 
happened” on the recall trial was also 
noted. If Gibson is correct, items 
appearing in a_ high-similarity list 
and having been given, say, three 
times correctly during original learn- 
ing, should show greater forgetting 
than an item given correctly three 
times but appearing in a low-similarity 


TABLE IV 


First-Triat Recaut as A FUNCTION OF 
SIMILARITY WHEN NuMBER oF CoRRECT 
RESPONSES DURING ORIGINAL 
Learninc 1s Hetp Constant 








Low Similarity High Similarity 


No. Correct 
during OL 





% 
Correct 





9.1 
16.2 
31.4 
43.8 
39.4 
56.1 
52.8 
72.5 
65.9 
9 $ 58.9 
10 : 74.3 
11 and over . 


SNAWMS WN © 





Total 

















TABLE V 


Tota, NumBer or Errors on 
First Recatt TRIAL 








Time between 
Learning Trials 
Interval 





4 sec. | 30 sec. 
Low Similarity 
0 min. 15 10 
24 hr. 16 
48 hr. 12 





High Similarity 
10 min. 28 
24 hr. 22 
48 hr. 28 

















list. This analysis was made for each 
condition of each experiment, but the 
consistency from condition to condi- 
tion permits the data to be presented 
for all conditions of low similarity as 
opposed to all conditions of high 
similarity. Table IV gives the es- 
sential information. The per cent 
correct at recall and the per cent error 
at recall are plotted in Fig. 2. 

In terms of per cent correct at recall 
following a constant number of rein- 
forcements (correct responses) during 
original learning, there is little evi- 
dence to support the Gibson theory. 
When equated for number of correct 
responses during original learning low- 
and high-similarity lists are recalled 
about equally well. The fact that 
Table III showed retention to be 
somewhat better for the high- than 
for the low-similarity lists is probably 
to be accounted for by the high rein- 
forcement (7, 8, 9, 10, 11 and over) 
given the high-similarity lists as com- 
pared with the low-similarity lists. 

With respect to the percentage 
error for each level of reinforcement, 
two facts are clear: (a) as degree of 
learning increases, number of errors 
decreases, and (b) more errors are 
made in recalling the high-similarity 
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T T 


LOW SIMILARITY 
HIGH SIMILARITY 


PERCENT OCCURRENCE AT RECALL 





CORRECT RESPONSES 


ERRORS 
0. 








NUMBER CORRECT RESPONSES 


10 il-OVER 
IN ORIGINAL LEARNING 


Fic. 2. Correct responses and errors at recall for low- and high-similarity lists when number of 
correct responses during original learning is held constant (all retention intervals combined) 


lists than in recalling the low-simi- 
larity lists. Thus, while there are 
more errors made in recalling the high- 
similarity lists—as would be predicted 
by the Gibson theory—these errors 
apparently do not reduce the effective- 
ness of the recall of correct responses. 
The examination of errors leads to 
another test of Gibson’s theory. If 
generalization recovers as a function 
of time, its recovery should be indexed 
by an increase in the number of errors 
as the retention interval increases. 
Data for evaluating this deduction 
are given in Table V. 

The data of Table V do not confirm 
the idea that errors increase as a func- 
tion of time since learning. On the 
basis of these data it is more correct 
to say that errors remain roughly con- 
stant as a function of passage of time. 


The variation in total errors exhibits 
no regular trend. 

Relearning.—The Ss relearned the 
list under each condition to a criterion 
of one perfect trial. These data will 
not be presented. In general, more 
trials to relearn were required as the 
retention interval increased. Re- 
learning was always faster following 
distributed practice than following 
massed practice, but in no case were 
the differences significant. Errors per 
trial were greater for the high- than 
for the low-similarity lists. 


Discussion 


The intervals of distributed prac- 
tice used here failed to facilitate the 
learning of paired-associate lists of 
either high or low similarity. And, 
although the error frequency was 
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greater for lists of similar items than 
for dissimilar items, no difference in 
frequency of errors was present as a 
function of degree of distribution of 
practice. Thus, as in the first study 
in this series (11), facilitation by dis- 
tributed practice was not found when 
the error frequency was the same for 
massed and distributed practice. 

Considering past studies as a whole, 
it may be said that distributed prac- 
tice has been found to benefit learning 
when difficulty is high. Difficulty 
may be increased by stepping up the 
presentation rate, by lengthening the 
list, or by increasing intra-task simi- 
larity. It was expected that the 
high-similarity lists used in the pres- 
ent experiments would result in in- 
creased rate of learning under dis- 
tribution. But, even though the task 
was difficult—several Ss had to be 
dropped because they could not learn 
the lists—no difference in learning 
under massing and distribution was 
observed. Increased rate of presenta- 
tion of the high-similarity lists might 
accomplish the objective. Hovland 
(5) has recently shown that if speed 
of presentation of paired nonsense syl- 
lables was increased from a 2-sec. rate 
to a l-sec. rate, distribution of trials 
benefited learning. Nevertheless, the 
fact remains that under the conditions 
of the present experiments, no effects 
of distributed practice in learning 
were observed. 

When the experiments were planned 
it was believed that distributed prac- 
tice would allow for successive extinc- 
tion of error tendencies. A_ rest 
interval, it was presumed, would allow 
recovery of intra-list generalization 
tendencies. On the next trial these 
tendencies would be differentiated or 
extinguished again. With the next 
rest interval the tendencies might re- 
cover but would recover less than pre- 
viously; hence, they would be extin- 
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guished more rapidly on the next trial, 
etc. The net effect would be very 
slow forgetting following distributed 
practice because the error tendencies, 
having had repeated extinctions, 
would recover very slowly. There 
are at least three sets of facts which 
make this hypothesis untenable. (a) 
Retention of high-similarity lists 
should benefit more by distributed 
practice than the low-similarity lists 
because of the greater number of gen- 
eralization tendencies. The reverse 
of this was found. (b) Errors during 
learning should be greater for dis- 
tributed than for massed practice, at 
least during the early stages of learn- 
ing. The data on errors as a function 
of stage of learning were not presented, 
but it can be categorically stated that 
they did not vary as a function of 
interval between trials. (c) Finally, 
if the hypothesis is tenable, there 
should be fewer errors at recall follow- 
ing distribution than following mass- 
ing. This was not the case; indeed, 
there tended to be a few more errors 
following distribution (see Table V). 
In short, the idea of successive ex- 
tinctions to account for superiority 
of retention following distributed prac- 
tice must be rejected on the basis of 
the data. Other explanations will be 
considered. 

A theory which postulates a form of 
reactive inhibition which accrues dur- 
ing massed learning and not during 
distribution cannot account for the 
better retention following distribution. 
Such a theory would predict better 
retention following massed practice 
than following distribution. The 
Yale theory of rote learning (6)reached 
an impasse on this matter. Likewise, 
McClelland’s performance-inhibition 
theory (9), since it is similar to the 
Yale theory on this point, would not 
be able to account for the retention 
data of the present experiments. 
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Another possibility might be stated 
in terms of practice at recalling. 
Cain and Willey (1) offer this as one 
possible explanation for better reten- 
tion following distribution. The S, 
having learned by distribution, has 
considerable practice in recalling dur- 
inglearning. He would not have such 
practice during massing. ‘To account 
for the present data such a theory 
would have to be stated in terms of 
specific lists; that is, it would have to 
state that S had specific practice in 
recalling a given list since all Ss had a 
great deal of practice in recalling fol- 
lowing rest intervals. Yet, even with 
this added assumption, there still re- 
mains the problem of explaining why 
distribution was not effective in the 
recall of the high-similarity lists. 
The idea of “practice in recalling” 
does not seem to bea fruitful approach. 

At the present time no satisfactory 
theory to account for the retention re- 
sults as a function of distribution can 
be advanced. 


Our results have been interpreted as 
being contrary to the expectations of 


Gibson’s theory (2). Three findings 
may be so interpreted: (a) equally 
good retention of items in high- and 
low-similarity lists when equated for 
degree of original learning; (b) failure 
of errors to increase as a function of 
retention interval, and (c) the failure 
of the greater number of errors at re- 
call for high-similarity lists (as com- 
pared with low-similarity lists) to be 
correlated with greater forgetting. 
In fairness to Gibson it should be 
pointed out that her theory is stated 
only in terms of stimulus generaliza- 
tion. The high-similarity lists used 
here were constructed so as to produce 
stimulus generalization, response gen- 
eralization, and stimulus-response gen- 
eralization. It is conceivable that the 
construction of these lists was such 
that other forms of generalization re- 
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sulted in processes which masked or 
nullified the operation of stimulus 
generalization as postulated in the 
Gibson theory. This does not seem 
too likely, however, in view of the 
recent findings that response general- 
ization seems to have the same prop- 
erties as stimulus generalization (8, 
10, 12). Nevertheless, before reject- 
ing the Gibson postulates concerning 
retention they should be tested under 
the conditions she had in mind when 
formulating the theory. We have 
found no confirming evidence and 
some which may be only apparently 
contrary to the theory. 

We have said that errors did not in- 
crease as a function of the retention 
interval as predicted by Gibson’s 
theory. Although Gibson has not 
suggested it, if we took the ratio of 
correct responses to errors it would be 
shown that relative to correct re- 
sponses, Overt errors increase as a 
function of the retention interval. 
Gibson’s theory does not, of course, 
say that errors should increase in fre- 
quency indefinitely. At some point 
they will start falling in frequency. 
No evidence of this fall is present with 
the intervals used here. 

With regard to the observation 
that retention of low- and high-simi- 
larity lists was equally good, it might 
be argued that a longer retention inter- 
val would eventually have produced 
greater forgetting for the high-simi- 
larity lists. However, the only trend 
noticeable was in the opposite direc- 
tion; the low-similarity lists tended 
to be retained relatively better after 
the short interval than after the 48-hr. 
period. 

SUMMARY 


These experiments were designed to 
deterraine the influence of (a) dis- 
tributed practice, and (b) intra-list 
similarity on learning and retention 
of lists of ten paired adjectives. The 
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three degrees of distribution were 4 
sec., 30 sec., and 2 min. between each 
learning trial. There were two de- 
grees of intra-list similarity, one very 
high and one low. The three reten- 
tion intervals were 10 min., 24 hr., 
and 48 hr. The 18 conditions were 
run as six experiments of three condi- 
tions each, with 18 Ss in each counter- 
balanced experiment. The basic re- 
sults follow: 


1. There were no differences in rate 
of learning to the criterion of 8 out of 
10 correct responses as a function of 
length of interval between trials. 

2. The high-similarity lists took 
many more trials to learn than did the 
low-similarity lists. More errors per 
trial were also evident. 

3. Distributed practice facilitated 
retention for the low-similarity lists, 
but not for the high-similarity lists. 

4. The frequency of errors at recall 
showed no relation to the length of the 
retention interval. 


5. When equated for degree of orig- 
inal learning, items in the low- and 


high-similarity lists retained 
equally well. 

6. The applicability of several the- 
ories of verbal learning and retention 
to these results was evaluated. These 
theories included the reactive inhibi- 
tion-type theory of Hull and his co- 
workers; McClelland’s performance- 
inhibition theory, and Gibson’s gen- 
eralization hypothesis. None of these 
theories was found capable of handling 
the basic findings of the present study. 


were 
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THE EFFECT OF MOTIVATION AND AMOUNT OF 
PRE-REST PRACTICE UPON INHIBITORY 
POTENTIAL IN MOTOR LEARNING! 


BY HILTON N. WASSERMAN 
George Fry and Associates? 


The immediate background of the 
present experiment may be best under- 
stood by referring to Kimble’s (7) 
summary and extension of Hull’s per- 
formance inhibition theory (3). Ac- 
cording to this theory, the total in- 
hibitory potential is made up of two 
elementary decremental components, 
reactive inhibition (Jr) and condi- 
tioned inhibition (s/r). These two 
factors are alike in that they both de- 
press the habit being learned. They 
differ, however, in many ways. 

Reactive inhibition is a drive state 
closely allied to pain avoidance. It 
is response-produced inhibition which 
results from or accompanies all effort- 
ful behavior -whether reinforced or 


not, and dissipates during a rest inter- 


val. If the rest intervals are not long 
enough to allow for the dissipation of 
a good deal of the Jr (as in massed 
practice), Jr» accumulates and the 
need for rest increases. When resting 
occurs, Jp dissipates and this serves 
as a drive reduction or reinforcement. 

Because of the reinforcement caused 
by rest, the response of resting will 


1 This paper constitutes a portion of a dis- 
sertation submitted in partial fulfillment of the 
requirements for the degree of Doctor of Philos- 
ophy, in the Department of Psychology in the 
New York University Graduate School of Arts 
and Science. The writer wishes to acknowledge 
with sincere thanks the unfailing aid and advice 
provided by Professor Leland W. Crafts through- 
out the progress of this investigation. The 
writer is also grateful to Professor Howard H. 
Kendler for his valuable suggestions, and to 
Professor Thomas N. Jenkins for his suggestions 
concerning the statistical treatment of the data. 

2 This study was completed while the author 
was at New York University. 


become a habit which will be condi- 
tioned to whatever stimuli are pres- 
ent in the learning situation. This 
conditioned resting tendency is s/p. 

The two inhibitory components 
are, then, a drive (Jz) and a habit 
(slz), and each must have the char- 
acteristics, respectively, of other 
drives and habits. Thus, s/z should 
develop as a positive growth function 
of the number of reinforcements, while 
Ip might be expected to accumulate as 
some increasing function of the 
amount of effort expended. Further, 
conditioned inhibition, being a habit, 
should show almost no diminution 
with the passage of time, while /p, 
being a drive, will be reducible by the 
appropriate goal response, resting.’ 
This latter difference makes it possi- 
ble (as has been done by Kimble, 7, 
p. 18) to trace the separate course of 
development of each of these two 
inhibitory components. 

Kimble (7) hypothesizes that with 
continuous practice (work), since Jz 
is a drive, the accumulation of a cer- 
tain critical amount will automatic- 
ally produce resting, thus resulting in 
dissipation of Jz before a formal rest 
interval is introduced. If motivation 
to perform the task continues, once 
Ip is reduced below the critical level, 
the organism will resume work and 
continue until the critical level of Je 
is reached again. Hence, a sort of 


3It should be pointed out, however, that 
because g/g is a habit, it should be affected by 
retroactive inhibition. Hence, what happens 
during an interval of time affects s/pz, while the 
mere passage of time (resting) after the cessation 
of the response diminishes Jp. 
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equilibrium will be reached in which 
the organism rests long enough to 
keep Jz at, or slightly below, some 
specific level. It is to be expected 
that whenever motivation decreases 
in learning, this level will drop some- 
what. 

Kimble further hypothesizes that 
some threshold value of /z is necessary 
before the process of development of 
slz can begin. Thus, early in learn- 
ing, when Jz has not yet reached this 
threshold value, no slr should be 
present. 

The results of Kimble’s experiment 
(7) support these two hypotheses and 
suggest a third: Even though the 
curve of development of s/z tends 
toward negative acceleration (as is 
expected for a habit), Kimble sug- 
gests that the reaching of the asympto- 
tic value will be postponed because 
any condition which tends to reduce 
the tolerable (critical) amount of Jp 
will increase the amount of resting, 
and therefore, the rate of development 
of slr. One such condition tending 
to reduce the critical amount of re- 
active inhibition could be the decrease 
in motivation which occurs late in 
learning with certain activities. 

By utilizing two motivation condi- 
tions, each having a spaced practice 
control group, a massed practice con- 
trol group, and nine experimental 
groups working under conditions of 
massed practice except for the intro- 
* duction of a 10-min. rest after varying 
amounts of practice, the present ex- 
periment is designed to test the fol- 
lowing hypothesis and five predictions 
related to it. The hypothesis is that 
high motivation should raise the 
critical (tolerable) level of Jz neces- 
sary to produce the automatic resting 
response. The predictions based 
upon this hypothesis and the preced- 
ing discussion are as follows: 
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1. Since high motivation should 
raise the organism’s resistance to /,, 
i.e., permit the accumulation of a 
higher level of Jz before an automatic 
resting response occurs, and since this 
resistance might, in turn, involve a 
greater expenditure of effort, thus re- 
sulting in more Jp, it may be predicted 
that late in learning Jz should be less 
with a poorly motivated group than 
with a highly motivated group. 
Hence, a rest interval late in learning 
should be more beneficial (in terms of 
dissipation of Jz) for a highly moti- 
vated group than for a poorly moti- 
vated group. 

2. If the amount of improvement 
after rest (which is due to the dissipa- 
tion of Jpg during rest) becomes 
smaller the later in learning the rest 
interval is introduced, as Kimble (7) 
and others have found, this decrease 
in improvement after rest should be 
less for the highly motivated group 
than for the poorly motivated group. 

3. Early in learning, before the 
threshold value of Jr has been 
reached, no s/z should be present in 
either of the motivation conditions. 

4. Since with the highly motivated 
group there will be less resting late in 
learning, and therefore, less frequent 
reinforcement (need reduction), it 
might be expected that late in learning 
the rate of development of s/z should 
be less in the highly motivated group 
than in the poorly motivated group, 
i.e., the asymptote should be reached 
sooner with high motivation than with 
low motivation. 

5. The amount of gs/,» late in learn- 
ing should be less in the highly moti- 
vated group than in the doorly moti- 
vated group. This prediction follows 
from the fact that even though less 
Tr will accumulate in the poorly moti- 
vated group, the need reduction (rein- 
forcement of s/p) produced by resting 
will be more frequent in this group 
than in the highly motivated group. 
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METHOD 


A total of 22 groups of Ss was utilized, each 
group being a class in general psychology at 
Washington Square College of New York Uni- 
versity. The experiment was administered to 
one group at a time, and a total of 538 Ss was 
used. 

The learning task was the alphabet printing 
task described by Ruch and Warren (12), and 
utilized by Kimble (7,8) and Kientzle (6). 
Briefly, the task requires S to print the letters 
of the alphabet in normal A to Z order, but up- 
side down and from right to left so that when the 
work sheet is rotated 180 degrees, the letters 
appear properly oriented. 

With the exception of the motivation vari- 
able, the procedure was essentially that used by 
Kimble (7). The Ss were divided into two 
motivation conditions, low and high. Each 
motivation condition was composed of 11 groups 
of Ss: a spaced control, a massed control, and 
nine experimental groups. Each of the experi- 
mental and control groups had 20-30 Ss. The 
nine experimental groups were given 50 massed 
30-sec. trials and a single 10-min. rest after trials 
2, 5, 10, 15, 20, 25, 30, 35, and 40, respectively. 
The control groups were given 50 30-sec. trials, 
the massed groups receiving no rest and the 
spaced groups receiving a 30-sec. rest after each 
trial. 


Each S was provided with five mimeographed 


work sheets and a pencil. The work sheets con- 
tained space for ten trials. To assure that all Ss 
printed letters of about the same height and 
width, the work sheet was set up in the form of 
cross section paper 4 X 4 to the inch so that each 
letter had to be printed in a }-in. square box. 
The five sheets were stapled together so that the 
Ss were required to turn a page after trials 10, 
20, 30, and 40. 

The task was explained and demonstrated to 
the Ss in both motivation conditions and they 
started working at the signal “go.” In each 
group except the spaced control groups the Ss 
were told after 29 sec. to go on to the next trial.‘ 
Since the Ss were required to turn a page after 
trials 10, 20, 30, and 40, a little time was lost 
on trials 11, 21,31, and 41. Hence, the scores on 
these four trials were discarded except in those 
groups which had rest after trials 10, 20, 30, and 
40. In the massed control groups the Ss were 
given a simple “stop” and “go” set of signals. 

Following Kientzle’s procedure (6), during 
the rest interval the Ss were told to remain “at 
ease” in their seats, and were asked not to dis- 
cuss the work with their neighbors. Also, fol- 
lowing Kientzle, the score was the number of 


4 See Kimble (7, p. 17) for the rationale of this 
timing of the trials. 


letters printed on each trial regardless of accur- 
acy. 

The difference in motivation was provided by 
the instructions given in the two conditions. 
The low motivation condition was task-oriented, 
the Ss being told that they were merely helping 
out with some preliminary experimentation to 
determine if the task could be adapted to a pro- 
posed future experiment. The high motivation 
condition was ego-oriented, the Ss being told 
that they were going to be given a new type of 
intelligence test the results of which would be 
reported to them and held strictly confidential. 


. These Ss were further told that the test was 


being given to the various psychology classes in 
order to set up average levels of performance. 

At the end of the experimental session the Ss 
in both motivation conditions were asked to 
indicate on the back of their work sheets any 
pencil difficulties or anything else that tet 
have interfered with their performance. 


REsuULTs AND Discussion 


Learning.—The basic data of the 
experiment are shown in Fig. 1 and 
2. The upper and lower curves show 
the performance of the spaced control 
groups and the massed control groups 
respectively. The intermediate 
curves show the post-rest perform- 
ance of the nine experimental groups 
(in each motivation condition) on the 
first ten trials following a 10-min. 
rest. The number on each of these 
curves is the number of the trial after 
which rest was given. The massed 
practice curves include the perform- 
ance of all the experimental groups 
before the introduction of the rest. 
The Ns for the various parts of the 
low-motivation massed practice curve 
(Fig. 1) are as follows: Trials 1-2, 242; 
Trials 3-5, 222; Trials 6-10, 199; 
Trials 12-15, 169; Trials 16-20, 141; 
Trials 22-25, 119; Trials 26-30, 94; 
Trials 32-35, 66; Trials 36-40, 46; 
Trials 42-50, 26. The comparable 
Ns for the various parts of the high- 
motivation massed practice curve 
(Fig. 2) are as follows: 245, 221, 196, 
168, 143, 116, 96, 75, 51, 30. 

It should be further noted that in 
plotting the massed practice curves 





INHIBITORY POTENTIAL IN MOTOR LEARNING 


Spaced Practice 


MEAN SCORES 








20 25 = 30 
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Fic. 1. Learning curves for the low motivation groups 


Practice 


Massed Practice 


MEAN SCORES 








TRIALS 


Fic. 2. Learning curves for the high motivation groups 


shown in Fig. 1 and 2, Trials 11, 21, utilized in drawing the curves because 
31, and 41 were omitted because they it was apparent that most of the Ss 
were shortened by the necessity of became confused in making the transi- 
turning pages of the work booklets. tion from Trial 1 to 2 and as a result 
In addition, the point for Trial 2 is did poorly on Trial 2 (as shown by 
shown on the graphs, but was not the point on the graphs). 
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It may be determined from Fig. 1 and 2 
whether the results bear out two basic assump- 
tions which the design of the experiment necessi- 
tated. First, following Kimble (7), it was as- 
sumed that the massed practice control groups 
worked under conditions of nearly maximal in- 
hibition, while inhibition was near a minimum 
for the spaced practice control groups. It is 
quite evident from Fig. 1 and 2 that the spaced 
groups are much superior to the massed groups, 
the difference being greatest on the last trial of 
the experiment in both motivation conditions. 
This finding would be expected if the accumula- 
tion of inhibition with massing produced the 
difference. Comparison of the present findings 
with those of Kimble (7) further support this 
assumption. The results of these two experi- 
ments are quite parallel in terms of performance 
scores with Kimble’s groups falling between the 
low and high motivation conditions of the present 
experiment. Thus, in the light of these com- 
parisons and the finding by Kientzle (6) that 
diminishing returns (in terms of amount of 
reminiscence) set in after 45 sec. of rest with 
1-min. trials, this first basic assumption seems 
quite justified. 

A second basic assumption inherent in the 
design of this experiment is that the performance 
of the high motivation groups would be superior 
to that of the low motivation groups. Prelimin- 
ary experimentation with several sets of in- 
structions revealed that it was not a matter of 
getting a maximum amount of motivation, but 
rather an optimum amount. It is quite evident 
from Fig. 1 and 2 that with the instructions 
which were finally utilized the performance of 
the high motivation groups was consistently 
superior to that of the low motivation groups. 
Further, these differences were significant at 
least at the .05 level of confidence, and in many 
cases beyond the .01 level of confidence. 


Reminiscence (Ip).—It may _ be 
noted from Fig. 1 and 2 that a 
striking feature of each of the experi- 
mental groups under both motivation 
conditions is a marked increase in per- 
formance after rest (reminiscence). 
Table I presents the mean increase 
for each of the 18 experimental groups 
along with the respective t-values of 
these increases. 

The occurrence of reminiscence 
provides a method of measuring re- 
active inhibition. It was here as- 
sumed (as other investigators have) 
that reminiscence is produced by the 


HILTON N. WASSERMAN 


TABLE I 
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* Significant at the .01 level of confidence. 
** Significant at the .001 level of confidence. 


dissipation of 7p during rest. Thus, 
if the rest interval is long enough to 
allow for the dissipation of all or most 
of the accumulated Jz, the magnitude 
of the increase after rest provides an 
indirect measure of the amount of 
Ip that was present before rest. 

The assumption that the 10-min. 
rest utilized in this experiment was 
ample to allow for nearly complete 
recovery from inhibition is supported 
by Kimble’s studies with the alphabet 
printing task (7,8) and also by the 
earlier pursuit rotor studies of Am- 
mons (2), Kimble and Horenstein 
(10), and Irion (5), all of which sug- 
gest that the maximal rate of dissipa- 
tion of reactive inhibition takes place 
during the first few minutes of rest 
with greatly diminished returns after 
about 5 min. of rest. 


One additional preliminary point remains to 
be dealt with before proceeding with the evalu- 
ation of the reminiscence data. This point con- 
cerns the statistical method used in dealing with 
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reminiscence in this experiment. Usually the 
reliability of the increase after rest is determined 
by comparing the increase in the experimental 
group with the “ normal” trial-to-trial increase 
which occurs in the massed control group with- 
out rest. In the present experiment, however, 
this comparison could be made only with those 
groups for which comparable massed practice 
scores were available. These scores were not 
available for Trials 11, 21, 31, and 41 because of 
the necessity of turning pages on these trials. 
Thus, the “normal” increase could not be com- 
puted for Groups 10, 20, 30, and 40. 

The substitute procedure which was used in 
this experiment (and apparently by Kimble [7]) 
assumed the pre-rest trials of the experimental 
groups to be comparable to the corresponding 
trials of the massed practice group. The null 
hypothesis was then set up that the introduction 
of a rest interval will produce no special increase 
in score, i.e., no increase other than the “normal” 
trial-to-trial increase. The p-values in Table I, 
of course, lead to the rejection of the null hy- 
pothesis. 

That this procedure is valid is confirmed by 
the p-values presented in Table II which are 
based upon the five comparisons with the massed 
control group which could be made since the 
turning of pages was not necessary at these 
points. Similar trends are indicated by both 
methods, there being only three points where 
the trends shown by Tables I and II are not in 
agreement. Furthermore, similar results were 
also gotten by use of the chi-square technique. 
Chi-square analyses confirmed the reliability of 
the increases after rest under both conditions of 
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** Significant at the .01 level of confidence. 

*** Significant at the .001 level of confidence. 
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motivation, and also the reliability of the differ- 
ences between the two motivation conditions 
(which will be discussed below). 


The results of the present experi- 
ment confirm and supplement Kim- 
ble’s (7) findings concerning the de- 
velopment of reactive inhibition. 
First, a major gap in Kimble’s experi- 
ment may be filled in by the present 
findings. The use of a group having 
rest after Trial 2 was designed speci- 
fically to test the hypothesis that Jr 
builds up extremely rapidly. The 
data presented in Table I lend support 
to this hypothesis. Also, Kimble 
found that the maximum amount of 
Ip was present after 7.5 min. of work, 
with 88.5 per cent of this maximum 
being present after 2.5 min. of work, 
and 99.4 per cent of it being present 
after 5 min. of work. The results of 
the present experiment show that 
with low motivation the maximum 
amount of Jz was reached after 5 min. 
of work, with 93.4 per cent of this 
maximum being present after only 1 
min. of work. With high motivation 
the maximum was reached after only 
1 min. of work, and the level of Jp 
was maintained throughout learning 
at about 70 per cent of this maximum. 


The percentages of the present experiment 
are probably somewhat inflated by a complicat- 


ing factor. It was noted above that there was a 
decrease in score on Trial 2 due to confusion on 
the part of the Ss in making the transition from 
Trial 1 to 2. Since the difference between the 
last pre-rest trial and the first post-rest trial was 
the measure of reminiscence used, the actual 
amount of reminiscence in the two groups having 
rest after Trial 2 was probably less than the value 
shown in Table I. That is, part of the difference 
was due to an increase after rest produced by 
the dissipation of Jp, but also part of it must have 
been due to the prior decrease in score on Trial 2. 
This inflation of the measure may be dealt with 
by using the score on Trial 1 rather than the 
(lower) score on Trial 2 to calculate the increase 
after rest in these two groups. When this is 
done, the amount of reminiscence for Group 2 
with low motivation becomes 6.15 instead of 
7.10, and with high motivation 6.59 instead of 
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7.29. It is obvious that the indicated trend is 
but little changed by this correction (actually 
overcorrection). With low motivation 80.9 per 
cent of the maximum amount of J/g was present 
after only 1 min. of work, while with high motiva- 
tion the maximum was still reached after only 1 
min. of work, with the level of Zp being main- 
tained throughout learning at about 77.5 per 
cent of this maximum. 


Regardless of the exact point at 
which the maximum [pz was reached, 
it seems justified to conclude that in 
the present study Jr was built up 
extremely rapidly. These findings 
are in substantial agreement with 
those of Ammons (2) who found with 
pursuit learning that 3 min. of work 
produced almost as much reminis- 
cence as the maximum amount which 
was reached after 8 min. of work. 

On the basis of our main hypothesis 
two major predictions concerning Jp 
were made. The first prediction was 
that late in learning there should be 
less Jz with low motivation than with 
high motivation. This prediction is 
confirmed by the data presented in 
It may be seen that the 


Table III. 
differences between the two motiva- 
tion conditions are not of any great 
magnitude and are not statistically 
significant (with one exception) until 
late in learning (after 30 pre-rest 


trials). Hence, the inference that 
high motivation should result in less 
resting during learning and thus, more 
Tp late in learning is clearly substanti- 
ated by the results of this experiment. 
Apparently, the poorly motivated Ss 
more frequently went into an auto- 
matic resting response resulting in 
the dissipation of Jz during learning, 
while the highly motivated Ss resisted 
this tendency to a greater extent. 
These findings are in agreement 
with those of Kimble (9) who used the 
pursuit rotor and a man-to-man com- 
petitive situation to obtain two levels 
of motivation. He hypothesized that 
Ss who are unsuccessful in the com- 
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TABLE III 


AMOUNT BY WHICH THE MEAN INCREASE IN 
Score AFTER Rest witu Hicu Motivation 
Exceeps THAT wity Low MorivaTIon 








| 
Mean | 
Difference 


Last Trial 
before Rest 











* Significant at the .05 level of confidence. 
** Significant at the .01 level of confidence. 


petitive situation are more highly 
motivated to do well in the learning 
task than successful Ss. He then 
made the prediction that the unsuc- 
cessful Ss will tolerate more reactive 
inhibition (will show more reminis- 
cence after a rest interval) than the 
successful Ss because the former are 
more highly motivated. His results 
confirmed this prediction, the overall 
difference in reminiscance being statis- 
tically significant at the .02 level of 
confidence. 

Closely related to these findings is 
the second prediction concerning /p. 
Since the amount of reminiscence 
tends to decrease as the rest interval 
is introduced progressively later in 
learning, it was predicted that this 
decrease should be less with high 
motivation than with low motivation. 
The results shown in Tables I and II 
confirm this prediction. It may be 
seen in Table I that with high motiva- 
tion, after an initial drop (from Group 
2 to 5), the level of Jp fluctuates up 
and down a little, maintaining a 
fairly constant level. However, with 
low motivation there is a definite 
tendency for the level of Iz to decrease 
from the maximum point as the 
amount of pre-rest practice increases. 
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The partial results shown in Table II 
also support the prediction, although 
not asclearly. Late in learning (from 
25 to 35 pre-rest trials) with low moti- 
vation there is a decrease in Jp of 
3.69, while with high motivation there 
is a decrease of only .46. 


An additional point concerning the warm- 
ing-up effect may be noted. As discussed by 
Ammons (1,2) and Irion (5), the magnitude of 
the warming-up effect is indicated by the amount 
by which the second post-rest trial exceeds the 
first. Using this method of measurement, 
Kimble (7) notes that his data fail to exhibit the 
warm-up phenomenon which Ammons (2) 
demonstrated with the pursuit rotor. That is, 
Kimble’s post-rest performance curves show a 
drop on the second post-rest trial, and, therefore, 
no warming-up effect is exhibited. This is 
especially interesting since the results of the 
present experiment, using the alphabet printing 
task as did Kimble, also show a rather sudden 
drop on the second post-rest trial. This is 
clearly evident in Fig. 1 and 2. 

It seems possible that Kimble’s data and 
those of the present experiment failed to exhibit 
any decrement due to need for warm-up because 
this decremental influence was greatly out- 
weighed by the beneficial effect of the dissipation 
of Ig during rest. It is also possible that in 
these experiments the warming-up effect was 
not of considerable magnitude (especially with 
the smaller amounts of pre-rest practice) since in 
both only a 10-min. rest was used, while Ammons 
found the maximum warm-up decrement after 
20 min. of rest in pursuit learning. 

If this explanation is correct, the increase on 
the first trial after rest must have been dimin- 
ished somewhat by the opposite influence of the 
need for warming-up. This inference has been 
tested by Irion (4), with nonsense syllables, by 
introducing varying amounts of warming-up 
activity just before the resumption of practice 
afterarest. He found that warming-up activity 
was beneficial up to a point, but then apparently 
produced additional reactive inhibition which 
diminished recall. With these results in mind, 
the introduction of varying amounts of warm- 
ing-up activity just before the resumption of 
work should result in an increase in reminiscence 
up to a point, and «hen a decrease as the amount 
of warming-up activity increases. 

Results relevant to this point are provided by 
one of Kientzle’s groups (6) in which ten 1-min. 
trials on the alphabet printing task were sepa- 
rated by 1-min. rest periods. This group was 
unique in that the entire rest interval was spent 
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in making tallies and cross bars on a separate 
sheet of paper. From the above discussion, it 
would be expected that with a l-min. work period 
and a l-min. rest the optimal amount of warm- 
ing-up activity would be very small, and that 
1 min. of warming-up activity would probably 
result in a decrement rather than facilitation. 
This group in Kientzle’s study was inferior in 
learning to another group differing from it only 
in that the rest intervals were spent in resting. 

A recent study by Silver (13) was designed 
specifically to investigate this problem. He 
utilized the alphabet printing task and intro- 
duced varying amounts (0, .5, 1, 1.5, 2, and 2.5 
min.) of massed and distributed warming-up 
activity during a 10-min. rest following five, 
fifteen and thirty 30-sec. trials. The warming-up 
activity was that used by Kientzle. He found 
that Ss who received warming-up activity were 
superior on the first post-rest trial to those who 
had rest with no warming-up activity. In addi- 
tion, with increasing amounts of massed warm- 
ing-up activity, reminiscence increased up to a 
point and then declined. On the other hand, 
with increasing amounts of distributed warming- 
up activity, reminiscence progressively increased. 

These findings are in agreement with the 
inferences stated above, and serve to clarify, at 
least partially, the relationship between reminis- 
cence and warming-up effect. 


Permanent decrement (sIz).—The 
previous discussion has noted some 
of the problems inherent in the use 
of reminiscence as a measure of /,z. 
Additional problems are encountered 
in the measurement of g/p. 


Kimble (7) has advanced one method of 
measuring s/z which makes use of the assump- 
tion that Jp, being a drive, dissipates during 
rest, while s/p, being a habit, does not dissipate 
with time. Thus, whatever decrement remains 
after a rest interval sufficient to dissipate Jp 
completely, should be due to s/z, and the meas- 
urement of this “permanent” decrement may be 
used as an indirect measure of sl/r. Further, 
since in Kimble’s experiment (and in the present 
one) the spaced practice group is assumed to be 
subject to a minimum of inhibition, the amount 
by which the performance of the spaced group 
exceeds that of the experimental groups on each 
initial post-rest trial should give an indication of 
the amount of permanent decrement, or slp, 
that was present in the experimental groups. 
Kimble’s procedure was to subtract the score 
on the first post-rest trial of each experimental 
group from the score of the spaced group on the 
trial of the same ordinal number. 
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However, part of the superiority of the 
spaced group may not be due to the decremental 
effects of s[g on the experimental group, but 
rather to the beneficial effects of a greater num- 
ber of reinforcements (greater habit strength) in 
the spaced group. This superiority of spaced 
practice is indicated by the higher level of per- 
formance attained on each trial, and becomes 
greater as practice progresses. Thus, the spaced 
group, at almost every point in the learning 
process, has printed a greater total number of 
letters than have the experimental groups (which 
have massed practice except for the single rest 
interval). It follows that the spaced group has 
received more frequent reinforcement, and 
therefore, habit strength should be greater in 
the spaced group than in the experimental 
groups at every trial except the first (3). There- 
fore, any difference between the initial post-rest 
performance of the experimental group and the 
spaced group on the trial of the same ordinal 
number confounds greater habit strength (sHp) 
in the spaced group and conditioned inhibition 
(sZg) in the experimental group. 

In view of this criticism of Kimble’s method 
of measuring slr, we have used, in addition to 
this method, a modification termed the cumula- 
tive mean method, which attempts to take into 
account the factor of differential number of rein- 
forcements by comparing the experimental and 
spaced groups on trials following similar numbers 
of responses (letters printed) rather than on 
trials of the same ordinal number. Although 
the matching of the cumulative means could 
only be approximate, the results are consistent 
and clear-cut. 

It may be seen from Table IV that the two 
methods suggest very similar trends, the main 
difference between the results of the two being 
that the values of s/z obtained by the cumula- 
tive mean method are consistently smaller than 
those obtained by Kimble’s method. It seems 
probable, then, that Kimble’s measure is being 
contaminated somewhat by some additional 
factor (perhaps habit strength). However, since 
only the absolute magnitude of the values is 
affected, while the relationships between the 
values remain approximately the same, the 
validity of Kimble’s conclusions concerning 
slp is not affected.§ 


Three specific predictions concern- 
ing the development of s/z have been 


5 It is interesting to note that the application 
of the cumulative mean method to the results of 
another study reported by Kimble (8) does not 
change the theoretical significance of the results. 
It merely reduces the absolute magnitude of 
slp from 4.6 to 3.7. 
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TABLE IV 


AMOUNT BY WHICH THE SPACED Group 
EXxcEEDS THE REMINISCENCE PoINT 
AFTER Nine AMOUNTS OF 
Pre-Rest Practice 
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* Significant at the .05 level. 
** Significant at the .01 level. 
*** Significant at the .001 level. 


made. According to the first, before 
the threshold value of Jz has been 
reached, no slr should be present, 
i.e., no slr should be present very 
early in learning. Further, since 
high motivation results in raising the 
critical level of Jz, and thus, less rest- 
ing during learning, it is expected that 
the initial appearance of s/e occurs 
somewhat later with high motivation 
than with low motivation. It may be 
seen from Fig. 1 and 2, and from 
Table IV, that the results of this 
experiment confirmed every aspect of 
this prediction. 

In Kimble’s experiment (7), slr 
first appeared after ten pre-rest trials. 
If the motivation in Kimble’s experi- 
ment is assumed to be somewhat 
greater than that of the low motiva- 
tion groups of the present experiment 
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(and the performance scores seem to 
bear out this assumption), the appear- 
ance of s/rg somewhat later in Kim- 
ble’s groups than in the low motiva- 
tion groups of the present experiment 
also fits the prediction. So likewise 
do the results of Kimble’s more recent 
study if it is assumed that the man-to- 
man competitive situation which he 
utilized resulted in a higher level of 
motivation than that attained in 
either his earlier experiments with the 
alphabet printing task or in both 
motivation conditions of the present 
experiment. Using 5-min. practice 
periods with the pursuit rotor and 
6-min. rest intervals, he failed to find 
evidence of conditioned inhibition. 
With high motivation, the Ss were 
able to resist the resting tendency 
produced by Jp for the full 5 min. of 
work so that no resting occurred dur- 
ing learning and all the Jr could be 
dissipated during the 6-min. rest. 
Thus, the threshold of Jr necessary 


TABLE V 


Mean DIFFERENCE BETWEEN THE LoW AND 
Hicu. Motivation ConpDITIONS IN THE 
AMOUNT BY WHICH THE SpaceD Group 
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* Group 2 is omitted because the post-rest perform- 
ance of this group in both motivation conditions was 
superior to the performance of the s group. 

These t-values are based upon the estimate of error 
variance obtained from an analysis of variance of the 
score differences presented in Table IV. 

¢ The minus sign indicates that the value for the high 
motivation group was greater than that for the low 
motivation group. 

4 Significant at the .001 level of confidence. 
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before slr can develop was never 
reached with this high degree of 
motivation. 

The second prediction stated that 
late in learning the rate of develop- 
ment of s/z should be less with high 
motivation than with low motivation. 
Reference to Table IV indicates that 
this prediction was also confirmed by 
the results. It may be seen that, re- 
gardless of the method of measure- 
ment, the increases in the amount of 
slr level off after 25 pre-rest trials 
with high motivation, while this 
starts to occur only after 30 pre-rest 
trials with low motivation. In gen- 
eral, these results agree with those of 
Kimble (7) who found that the asymp- 
tote had not yet been reached after 
30 pre-rest trials. 

The third prediction stated that 
late in learning the amount of gle 
should be less with high motivation 
than with low motivation. Again, 
the results presented in Tables IV and 


V clearly confirm this prediction. 
It may be seen that the difference 
between the two motivation condi- 
tions becomes greatest beyond 25 pre- 


rest trials. Further confirmation was 
gotten by an analysis of variance of 
the score differences which are pre- 
sented in Table IV. The F-ratios for 
both motivation and amount of pre- 
rest practice were found to be signifi- 
cant beyond the .05 level of confidence 
regardless of the method of measuring 
sl pr. 


SUMMARY 


The alphabet printing task was ad- 
ministered to 538 Ss under two condi- 
tions of motivation, low and high. 
Under each motivation condition a 
massed control group practiced under 
conditions of maximal inhibition, a 
spaced control group practiced under 
conditions of low inhibition, and nine 
experimental groups worked under 
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conditions of massed practice except 
for the introduction of a 10-min. rest 
after varying amounts of practice. 
The results and principal conclusions 
are as follows: 


1. Spaced practice was found to be 
definitely superior to massed practice. 

2. High motivation resulted in per- 
formance which was significantly sup- 
erior to that with low motivation, the 
difference becoming progressively 
greater as practice continued. 

3. Reactive inhibition (Jpg) was 
built up extremely rapidly with both 
high and low motivation. 

4. Less Jz was present late in learn- 
ing with low motivation than with 
high motivation. 

5. The decrease in reminiscence 
which usually occurs late in learning 
was clearly shown with low motiva- 
tion, but was less marked with high 
motivation. 

6. No conditioned inhibition (s/,p) 
was present very early in learning 
under either motivation condition, 
and its initial appearance occurred 
later with high motivation than with 
low motivation. 

7. Late in learning the amount and 
rate of development of s/z were less 
with high motivation than with low 
motivation. 

8. The results of this experiment 
are in substantial agreement with the 
hypothesis that high motivation 
should raise the critical (tolerable) 
level of reactive inhibition necessary 
to produce the automatic resting 
response. 


(Manuscript received 
September 21, 1950) 
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THE ACQUIRED DISTINCTIVENESS OF CUES: THE ROLE OF 
DISCRIMINATIVE VERBAL RESPONSES IN 
FACILITATING THE ACQUISITION OF 
DISCRIMINATIVE MOTOR RESPONSES 


BY IRMA L. ROSSMAN AND ALBERT E. GOSS! 


The University of Massachusetts 


Miller (24), extending his earlier 
work with Dollard (25), has proposed 
that: 


According to stimulus-response theory, learn- 
ing to respond with highly distinctive names to 
similar stimulus situations should tend to lessen 
the generalization of other responses from one of 
these situations to another since the stimuli pro- 
duced by responding with the distinctive name 
will tend to increase the differences in the stimu- 
lus patterns of the two situations. Increased 
differentiation based on this mechanism has 
been called acquired distinctiveness of cues. 

On the other hand, if the individual learns to 
respond to two quite different situations with 
the same verbal response, the stimuli produced 
by this response will be a common element 
mediating an increased amount of generalization 
from one situation to the other. This has been 
called acquired equivalence of cues, or secondary 
generalization (24, p. 174). 


Although experimental demonstra- 
tions of semantic generalization (4, 5, 
7) and the studies of Shipley (30) and 
Lumsdaine (22) are indirectly rele- 
vant to the above analysis of the ac- 
quired equivalence of cues, in only 
one study (2) were the problem and 
experimental design directly formu- 
lated in Miller’s terms. In this ex- 
periment Birge found that generaliza- 
tion or positive transfer of a manipula- 
tive response from one stimulus to 
another stimulus occurred more read- 
ily when young children had previ- 
ously learned to give the same non- 


1A more complete report may be found in 
(27). The first author was responsible for col- 
lecting and processing the data while the second 
author is in part responsible for the manuscript 
in its present form. Parts of these data were 
presented at the 1950 meetings of the American 
Psychological Association. 


sense syllable response to the two 
stimuli for the manipulative response. 
These findings, supplemented by data 
which indicated that overt verbaliza- 
tion facilitated positive transfer, were 
consistent with expectations based on 
Miller’s theoretical analysis. 
Lawrence (19,20) and_ subse- 
quently Jaynes (18) have reported 
data for discrimination learning in the 
rat which are consistent with predic- 
tions derived from Miller’s analysis 
of the acquired distinctiveness of cues. 
However, at the time the present study 
was initiated there appeared to be no 
studies of acquired distinctiveness 
with human Ss which could have 
been considered directly relevant in- 
vestigations of Miller’s hypothesis. 
Accordingly, it was a major purpose 
of this study to investigate the hy- 
pothesis that the acquisition of different 
verbal responses (names) to similar ex- 
ternal stimuli will facilitate the subse- 
quent acquisition of discriminative motor 
responses to the same external stimuli. 


Subsequent to the completion of the present 
study, Gagné and Baker (8) have reported that 
the amount of positive transfer to a motor dis- 
crimination task involving similar color-position 
cue combinations is a direct function of the 
amount of prior experience in distinguishing 
among these cues by means of different letter 
labels. Although not explicitly formulated in 
Miller’s terms these results are in accordance 
with his analysis of acquired distinctiveness. 
In a later study Baker and Wylie (1) required 
three groups to learn a motor discrimination 
after the groups had had 0, 8, or 24 rehearsals in 
associating the names for the correct motor re- 
sponses with the names for the color-position 
cues for the motor responses. While eight re- 
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hearsal trials did not lead to significant positive 
transfer, 24 of such trials resulted in significant 
transfer effects. If the assumption that the dis- 
tinctive verbal responses associated with the 
names for the color-position cues generalized to 
the actual cues during the motor discrimination 
trials is granted, these results are also consistent 
with the notion of the acquired distinctiveness of 
cues. 


In the present investigation trans- 
lation of the above hypothesis into 
experimental procedures involved 
comparison of the rates of acquisition 
of a motor discrimination by a group 
that had previously learned a verbal 
discrimination to a rigorous criterion, 
and groups that had been given a very 
small number of verbal learning trials. 

Because the present study was con- 
sidered to be preliminary to a series of 
studies stemming from Miller’s analy- 
sis of the functional role of verbal 
cues two subsidiary and an additional 
major hypothesis were explored. In 
order to test one subsidiary hypothe- 
sis, that overt verbalization during 


motor learning will lead to more rapid 
learning than covert verbalization, as 
suggested by Birge’s findings of the 
facilitative effects of overt verbaliza- 
tion, two groups which had learned 


the verbal discrimination to a cri- 
terion were used with different in- 
structions during the motor learning. 

Work reported by Gibson and 
Gagné and Foster as well as Miller’s 
treatment of acquired distinctiveness 
of cues provided the theoretical 
background for a second subsidiary 
question. Gibson predicted (11) and 
subsequently verified (13) the ex- 
istence of an initial increase and 
subsequent decrease in intra-list gen- 
eralization during the course of paired- 
associate learning. In a subsequent 
investigation of transfer from practice 
on components in a discriminative 
motor task Gagné and Foster (9) also 
found evidence which was consis tent 
with Gibson’s hypothesis of an up- 
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ward-downward trend in generaliza- 
tion. It is an implication of Miller’s 
approach that increased generaliza- 
tion of verbal responses to stimuli 
should increase the difficulty of learn- 
ing motor discrimination to the same 
stimuli. These considerations led to 
the hypothesis that, due to increased 
generalization of verbal responses, a 
small number of verbal discrimination 
trials will lead to less rapid learning of 
a motor discrimination than either one 
or many verbal discrimination trials. 
Two groups with a small number of 
verbal learning trials, specifically one 
and four trials, were used to investi- 
gate this hypothesis. 

Within the framework of stimulus- 
response language Miller has inter- 
preted the Freudian concept of primal 
repression as a decrement in the 
strength of verbal responses resulting 
from conflict between the verbal re- 
sponses and anxiety-motivated re- 
sponses of avoiding overt verbaliza- 
tion.? In the specific case of the 
mechanisms of acquired distinctive- 
ness or equivalence of cues, the con- 
flict would be between verbal re- 
sponses which have functional sig- 
nificance in discriminative or general- 
izing behavior and anxiety-motivated 
avoidance’ responses. Behaviorally 
this conflict would be expressed di- 
rectly as a decrement in the measured 
strength of verbal response. It would 
also be reflected indirectly as a decre- 
ment in the strength of discriminative 
or generalizing behavior which was 
functionally dependent on the verbal 
response-produced stimuli. It is the 
latter or indirect consequence of 


2 With the exception that covert or internal 
responses (anxiety) and anxiety-motivated overt 
responses have been distinguished, this concep- 
tion of repression is essentially similar to Sears’ 
(28) stimulus-response analysis of primal re- 
pression. Previous experimental work on re- 
pression has been summarized (14, 28, 29, 32). 
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repression which led to a second major 
hypothesis, that repression of the ver- 
bal responses mediating acquired dis- 
tinctiveness of cues will lower the rate 
of acquisition of discriminative motor 
responses. 


APPARATUS AND STIMULUS 
MATERIALS 


A pparatus.—The stimulus materials were pre- 
sented in the exposure slot of a memory drum at 
the rate of one every 2 sec. The interval be- 
tween trials was 8 sec. Throughout the training 
period E was seated behind the drum out of the 
direct view of S. Additional apparatus con- 
sisted of a buzzer, a shocking device, and a 
motor-response device. The shocking device 
was an inductorium with a 1.5-v. dry cell cur- 
rent source. Shock was administered to S 
through a small wrist band with metal contacts 
which served as electrodes on the inside and out- 
side surfaces of the wrist. The motor response 
device, a 6 X 6 X 1 in. box, was constructed to 
permit the movement of a 3} in. lever along 5} 
in. slots which started from a point near the 
middle of one side of the top surface and ex- 
tended diagonally forward toward the right and 
left corners of the opposite side. Movement of 
the lever to the end of the diagonal slots closed a 
circuit which in turn lighted one or the other of 
two lights arranged behind the drum to inform 
E of the direction of S’s response. The response 
device was placed in front of the right side of the 
shield of the drum; movement of the lever was 
diagonally left or right toward the shield. 

Stimulus materials—Lists of 16 and 12 
paired-associate units and a list of 12 nonsense 
figures were employed as stimulus materials. 
The stimulus members of the 16-unit list con- 
sisted of eight of Gibson’s (12) 13 standard 
figures and the eight corresponding figures of 
first-degree similarity. The response members 
of the units were eight pairs of Hull’s (16) non- 
sense syllables of approximately equal (within 
+4 per cent) and low (mean = 21.2 per cent) 
association values. These syllable pairs were 
selected so that the muscular movements re- 
quired for saying one member of each pair would 
be antagonistic to the movements required for 
saying the other member. Specifically, the first 
and last sounds (consonants) of one member of 
each pair of syllables were stops while those of 
the other member wete continuants (21). The 
eight stop syllables were assigned to the stand- 
ard figures and the eight continuant syllables to 
the figures of first-degree similarity. Since the 
16-unit list proved to be very difficult to learn, 
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six of the original eight pairs of similar figures 
and their paired syllables were used to construct 
an easier 12-unit list. The same six pairs of 
figures without the paired syllables were then 
employed to make up a nonsense figure list. 

Both figure-syllable pairs and figures alone 
were presented in four sequences in which each 
pair or figure appeared in a different quarter of 
the list during each sequence. Pairs containing 
similar figures or similar figures alone did not 
follow one another. 

Since a maximum of eight paired-associate 
units could be presented with one rotation of the 
drum, it was necessary to divide the 16- and 
12-unit lists into equal halves. The equal 
halves of each of the four sequences were placed 
in adjacent columns on the tape. The S shifted 
the exposure slots on the face of the drum when 
such was required. This required from 5 to 10 
sec. 


EXPERIMENT I: Errects oF SHocK 


Purpose 


This preliminary experiment was 
designed to determine whether shock 
would decrease the amount of reten- 
tion of well-learned verbal responses. 
Demonstration of repression due to 
electric shock is an essential condition 
to the test of the hypothesis relating 
repression to the effects of acquired 
distinctiveness of cues. 


Procedure 


Acquisition.—Thirty-seven undergraduate 
students in psychology at the University of 
Massachusetts learned the 16-unit figure-syllable 
list to three consecutive errorless trials. Seven 
of these were eliminated for failure to reach the 
criterion in 45 trials. 

The 30 remaining Ss were divided into shock 
and no-shock groups of 15 Ss which were 
matched by pairing each shock S with an indi- 
vidual no-shock S to achieve equal mean trials 
to learn (31.5) and standard deviations (7.5) 
with maximal matching in terms of sex. Indi- 
vidual shock and no-shock Ss were paired by 
assigning each S who reached the criterion within 
+2 trials of a no-shock S, preferably of like sex, 
to the shock condition. The assignment was 
made as the S completed the third criterion 
trial thus permitting the introduction of the 
shock trial without lengthening the usual 8-sec. 
inter-trial interval. 

The Ss were given standard instructions for 
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paired-associate learning with the following 
specific variants: the syllables were anticipated 
by pronouncing them, S was instructed to guess 
when unsure, and S was instructed to pronounce 
the correct syllable when it appeared, if an in- 
correct anticipation had been made. 

Shock.—All Ss had had their pain or discom- 
fort thresholds for shock determined for their 
non-preferred hands before training. Those in 
the shock group were given one additional trial 
on the list after reaching the criterion, during 
which all anticipatory syllable responses whether 
correct or incorrect were followed by shock. If 
no response was made, the shock was adminis- 
tered after the syllable was exposed on the drum. 
The no-shock group received the same additional 
trial, but without shock. 

Retention.—Eight seconds after the additional 
trial, with or without shock, both groups were 
given two additional trials. At the end of the 
retention test the Ss of the shock group were 
given a short questionnaire designed to deter- 
mine their reactions to the shock and retention 
trials. 


Results 


In Table I are shown the results of 
Exp. I. On the post-criterial trial 
during which shock was administered 
to the shock group, fewer correct re- 
sponses were made. This difference 
is significant at better than the .01 
level (t = 4.23). On the first and 
second retention trials, the shock 
group likewise made fewer correct re- 
sponses than the no-shock group. 
However, the differences are not 
highly significant, even though con- 
sistent. For the first retention trial 
t is 2.40 (.05 > p > .02), for the sec- 
ond trial 1.78 (p > .10), and for both 
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trials combined 2.23 (.05 > p > .02). 
In all estimates of significance the 
correlation between matched scores 
was taken into account (df = 14). 

One S in the shock group failed to 
make any correct responses during 
the two retention trials. Elimination 
of this S and his paired S in the no- 
shock group increased all t-values 
somewhat and also increased the sig- 
nificance levels despite the loss of one 
degree of freedom. 

Analysis of the questionnaire data 
indicates that the shock Ss were irri- 
tated, puzzled, tense, and fearful, in 
that order of decreasing strength and 
frequency, during the shock trial. 
Tenseness and anxiety were the 
strongest reactions during the reten- 
tion trials, with some Ss reporting 
hesitancy to say the syllables. 


EXPERIMENT II: AcquirRED 
DIsTINCTIVENESS OF CUES 


Procedure 


Experimental design.—The conditions of this 
experiment are outlined in Table II. The five 
groups were included to permit experimental 
answers to the following problems: (a) the 
determination of the facilitating effects of dis- 
criminative verbal responses by comparison of 
Group E-I with control Groups C-I and C-II; 
(b) the determination of the relative effective- 
ness of overt and covert verbalization by com- 
parison of Groups E-I and E-III; (c) the deter- 
mination of the retarding effects of shock by 
comparison of Groups E-I and E-II; and (d) 
the determination of the role of intra-list general- 
ization by comparison of Groups C-I and C-II. 


TABLE I] 


Errect oF SHock ror Correct or INcorrEcT VERBAL RESPONSES 
ON FREQUENCY OF Correct ANTICIPATIONS 








Retention Trials 





Condition 








Shock 
No-shock 


[8 
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TABLE II 


Conpitions 1n Experiment II 








Verbal 
Learning 


Groups 
(N =15) 


Post-Learn- 
ing Training 


Motor 
Learning 





E-I Mastery* 
E-II 
E-III 


1 trial, 20 trials 
no shock 
1 trial, 
shock 

1 trial, 
no shock 


Mastery 20 trials 


Mastery 20 trials, 
overt ver- 
balization 
20 trials 
20 trials 


None 
None 


1 trial 


C-Il 4 trials 














* Defined as three successive errorless trials. 


Acquisition of verbal responses.—Sixty-three 
Ss to be assigned to Groups E-I, E-II, and E-III 
learned the 12-unit figure-syllable list to three 
consecutive errorless trials. Seventeen of these 
Ss failed to reach the criterion in 45 trials and 
were eliminated. Of the remaining 46, 45 were 
distributed into three equal groups according to 
the matching procedure employed in Exp. I, 
with the result that Groups E-I, E-II, and E-III 
required 24.0, 24.1, and 24.5 trials to learn with 
SD’s of 6.0, 6.0, and 5.5, respectively. An addi- 
tional 15 Ss were assigned to Group C-I and 
given one learning trial on the 12-unit list. 
Similarly, 15 Ss were assigned to Group C-II 
and given four learning trials. Assignment to 
these groups was on a matching basis to be de- 
scribed in the section on the motor task. 

Post-criterial trials —The Ss in Group E-III 
were given shocks on one post-criterial trial. 
The interval between this trial and the third 
criterial trial was the usual 8 sec. As in Exp. I, 
Groups E-I and E-III were given one post-cri- 
terial trial without shock. All Ss had been pre- 
viously tested to determine their pain or discom- 
fort thresholds to shock in the non-preferred 
hand. 

Acquisition of motor responses.—Approxi- 
mately 1 min. after completion of the verbal 
learning all groups practiced a motor discrimina- 
tion for 20 trials. Only the Ss of Group E-III 
were instructed to say the previously learned 
nonsense syllable response before or concomitant 
with the motor response. In the motor learning 
situation Ss learned to discriminate by means of 
left or right movements of the lever of the motor- 
response device, between pairs of similar non- 
sense figures. Specifically, they were to learn 
movement to the left as the correct response to 
each of the standard figures. Conversely, 
movement to the right was designated as the 
correct response to the corresponding figures of 
first-degree similarity. Knowledge of correct 
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responses was given by means of a buzzer which 
was sounded by E after S pushed the lever to 
the end of the correct slot. No correction of 
incorrect responses was permitted. 

The Ss in Groups C-I and C-II were paired 
with the Ss in Group E-I on the basis of the 
motor task. Each S of Group E-I was matched 
with Ss of Groups C-I and C-II who were within 
+0.50 mean correct responses for the first block 
of 4 trials. Whenever possible Ss were further 
matched on the basis of sex, and Ss were run 
under the C-I and C-II conditions until the mean 
number of correct responses for the first four- 
trial blocks for the three groups were equal. 

The instructions for the motor task merely 
told S to move the lever to the left or right when 
a figure appeared in the stimulus aperture and to 
return the lever to neutral after each response. 
They also informed S that a buzzer would sound 
if the direction of movement was correct, and 
that no correction of errors could be made. The 
Ss were not specifically informed that the figures 
to be seen were those to which nonsense syllable 
responses had previously been learned, except 
in the case of Group E-III. These Ss were in- 
structed to make the previously learned non- 
sense-syllable response to a given figure before 
or concomitant with making the motor response 
to that figure. 

Questionnaire.—All Ss were given question- 
naires after the motor learning. One question- 
naire, given to all groups, was designed to elicit 
additional information about the method used 
in the motor discrimination. The Ss of Group 
E-II filled out a second questionnaire on their 
reactions to the shock trial. 


Results 


With respect to the effect of shock 
on the verbal responses during the 
post-criterial trial, Table III clearly 
reveals the inhibitory effect of shock. 
The differences between the shock 
(E-II) and no-shock conditions (E-I, 


TABLE Ill 


Errect or SHock ON FREQUENCY OF 
Correct RESPONSES DURING THE 
First Post—CriteriaL TRIAL 








Mean 


Condition | 





No-shock 
Shock 
No-shock 
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TABLE IV 


Mean Correct RESPONSES DURING 
Motor LEARNING 








Trials 1-4 Trials 5-20 


Group 

















E-III) are significant beyond the .01 
level. The difference between Groups 
E-I and E-III is not significant. 
Group C-II made an average of 2.73 
correct anticipations on the fourth 
verbal learning trial. Unfortunately, 
a mistake in the scoring procedure pre- 
cluded the determination of intra-list 
generalization during this trial. 

Motor learning data are presented 
in Table IV in terms of the means and 
standard deviations of correct re- 
sponses made in Trials 1-4, the match- 
ing trials, and Trials 5-20, the com- 
parison trials. In accordance with 
the pre-experimental criterion, Groups 
E-I, C-I, and C-II have equal first 
means for Trials 1-4. The mean 
values of 6.02 for these groups do not 
differ significantly from the somewhat 
lower means of Groups E-II and 
E-III. 

Comparison of the mean for Trials 
5-20 of Group E-I with those of 
Groups C-I and C-II yielded ?#’s of 
related measures of 3.28 (p < .O1) 
and 3.32 (p< .01), respectively.’ 
For corresponding differences between 
Groups E-I and E-II and Groups E-I 
and E-III, t’s of 1.68 (.20 > p > .10) 
and 2.40 (.05 > p > .02) were ob- 


3 Due to unconventional matching procedure 
it was considered inadvisable to utilize the more 
sophisticated techniques for group comparison 
in transfer studies advocated by Gagné, Foster, 
and Crowley (10). 


tained. The difference between 
Groups C-I and C-II did not approach 
significance (t = 0.56). All p-values 
are based upon 14 df. 

Learning curves on the motor dis- 
crimination are shown in Fig. 1. 
The measure of learning is the mean 
correct responses for successive four- 
trial units. Although Group E-I had 
reached a mean score of only 75 per 
cent correct responses by Trials 17-20, 
it is apparent that this group was 
consistently superior to the other 
groups. Group E-II was learning 
the motor discrimination at a some- 
what lower rate than E-I. However, 
Group E-II was in turn performing at 
a higher level than the remaining 
groups during the last eight trials. 
The discrimination problem, as evi- 
denced by the very slow rise of the 
curves, was much more difficult for 
Groups C-I, C-II, and E-III. How- 
ever for trials 13-20 the performance 
of E-III was superior to that of the 
control groups. 

Analysis of the questionnaire data 
concerning reactions to shock for 
Group E-II indicated that Ss of that 
group felt puzzled, irritated, like 
hesitating to say the syllables, and 
tense, in order of decreasing strength 
and frequency. The groups with 
verbal learning experience used the 
syllables as facilitating devices to 


oye 
* 








TRIALS 


Fic. 1. Learning curves for mean correct 
responses for successive four-trial blocks 
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some degree during the motor learn- 
ing. However, they reported that 
looking for identifying parts was more 
frequently employed, and as sub- 
jectively evaluated, more helpful. 
The Ss of Groups C-I and C-II also 
reported that they used the identifying 
parts technique. There appeared to 
be only slight differences among the 
five groups in the frequency of use of 
this technique. 


Discussion AND CONCLUSIONS 


Group E-I, which had mastered the 
verbal discrimination, learned the 
motor task more rapidly than Groups 
C-I and C-II, which had had only a 
very small number of verbal learning 
trials. Further, questionnaire results 


revealed that Ss of Group E-I had 
made covert nonsense-syllable _re- 
sponses during the course of motor 
learning and that, evaluated subjec- 
tively, these covert verbal responses 
had facilitated the acquisition of the 


motor discrimination. These results, 
which are consistent with those re- 
ported by Gagné and Baker (8), sup- 
port the hypothesis that learning dis- 
criminative verbal responses facili- 
tates the learning of discriminative 
motor responses to the same stimuli. 
However, before the superiority of 
Group E-I can be interpreted as an 
unequivocal confirmation of the spe- 
cific hypothesis of transfer based upon 
the mechanism of the acquired dis- 
tinctiveness of cues, the significance 
of the following conditions must be 
investigated. 


First, since the questionnaire results revealed 
that “looking for and identifying parts” was the 
most frequently used, and, as subjectively 
evaluated, most facilitative technique for all 
groups, it is possible that the Ss of Group E-I 
learned the motor discrimination more rapidly 
because of their prior experience in using thi, 
technique. On closer analysis it would appear 
that the Ss could have interpreted the phrase 
“looking for and identifying parts” as referring 
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to any one or all of three distinct techniques: 
(a) learning to select parts of pairs of similar 
figures which were more discriminable than the 
whole figure;* (b) learning to select parts of the 
figure pairs which may or may not have been 
more discriminable, and then learning to make 
the nonsense-syllable response to the selected 
parts; and (c) learning to select parts which may 
or may not have been more discriminable and 
then identifying the parts with familiar rather 
than nonsense-syllable names. As a _ conse- 
quence, the above information is of little value 
as the basis of alternative explanations of the 
results of the comparisons of Groups E-I with 
C-I and E-I with C-II. It should be noted, 
however, that both the second and third of the 
possible interpretations involve variations of 
the mechanism of acquired distinctiveness. 

Second, during the learning of the verbal dis- 
crimination Ss of Group E-I may have also 
learned that there were pairs of similar figures. 
This knowledge, in the form “go left for one 
member of the pair and go right for the other 
member,” may have facilitated motor learning 
for this group. 

Finally, it might be hypothesized that Group 
E-I was aided by factors such as warm-up and 
knowledge of the orders of the figures. 


It is suggested that until future 
experimentation isolates and indicates 
the functional relevance of the above 
conditions, the comparisons involving 
Group E-I are best interpreted as 
supporting the first major hypothesis 
that the acquisition of discriminative 
verbal responses will facilitate the 
acquisition of discriminative motor 
responses to the same external stimuli. 

Comparison of mean correct re- 
sponses and of learning curves for 
Groups E-I and E-III, the overt 
verbalization group, revealed that 
contrary to the first subsidiary hy- 
pothesis of the superiority of overt 
verbalization, the former group 
learned the motor task more rapidly. 
Reports volunteered by Ss at the con- 
clusion of the motor discrimination 


‘The nonsense figure stimuli were not “per- 
ceptually simple” cues. Hence it is possible 
that orientation behavior (31) or “looking for 
and identifying” responses were relevant to the 
early stages of both verbal and motor discrimina- 
tion learning. 
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provide the basis for a tentative ex- 
planation of the unexpectedly low 
performance of Group E-III. In 
general, Ss found that they did not 
have enough time to think of and to 
make the verbal response and to make 
the motor response during the 2-sec. 
response interval. The instructions 
to say the syllables aloud apparently 
introduced an  interfgrence factor 
which retarded the acquisition of the 
motor discrimination. 

The second of the subsidiary hy- 
potheses, that Group C-II, which had 
four verbal learning trials, would learn 
the motor discrimination less rapidly 
than Group C-I, the one-trial group, 
was based on Gibson’s (13) findings 
that intra-list generalization increased 
to a maximum during the third and 
fourth tenths of total learning trials. 
Comparison of total scores revealed 
a slight but not statistically signifi- 
cant difference in favor of Group C-I. 
In part because of the previously 
noted mistake in scoring which pre- 
cluded assessing the degree of intra- 
list generalization for Group C-II, 
the data do not permit the isolation 
of the factors which may have ac- 
counted for the failure to obtain a 
significantly retarded performance for 
Group C-II. 

Under the shock-trial conditions of 
Exp. I, according to principles ad- 
vanced by Mowrer (26), Miller (23), 
and others (3,6), internal and overt 
responses to shock should have been 
conditioned to the cue pattern of 
figure plus verbal response-produced 
stimulation. This conditioned form 
of the internal response to pain has 
been termed anxiety or fear. On the 
subsequent retention trials, presenta- 
tion of the figure alone should have 
(a) by the principle of generalization 
(17), elicited the internal (anxiety, 
fear) and overt responses previously 
aroused by shock, and (b) reinstated 
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the cue-pattern of figure plus response- 
produced stimulation most strongly 
conditioned to anxiety. However, 
the increase in anxiety occasioned by 
reinstating the cue-pattern could have 
been prevented by failing to respond. 
Since shock was an immediate con- 
sequence of both correct and incorrect 
responses but was given only after 
exposure of the correct syllable in the 
case of failure to anticipate, avoidance 
of the verbal responses would have 
had the two-fold effect of preventing 
an increase in anxiety and of post- 
poning shock. It was anticipated, 
therefore, that as a result of the pres- 
ence of conflicting anxiety-motivated 
avoidance responses the strength of 
the verbal responses on the two post- 
shock retention trials would be re- 
duced. This expectation was tenta- 
tively confirmed by small decrements, 
significant at better than the .05 
level, in the first and combined re- 
tention test scores of the shock group. 

These results provided the basis 
for the second major hypothesis of 
Exp. II, that due to repression of the 
discriminative verbal responses the 
shock group (E-II) would learn the 
motor task less rapidly than Group 
E-I. While consistent with this hy- 
pothesis, the observed superiority of 
Group E-I in mean correct responses 
was not statistically significant. 

The failure to obtain a significant 


5 Farber (6) has hypothesized that responses 
which have been followed by shock will be 
strengthened relative to other responses under 
conditions of unavoidable punishment and/or 
high initial strength of the punished response. 
Further, Gwinn (15) has demonstrated that if 
the punished response is compatible with the 
escape response, punishment will, at least initi- 
ally, strengthen the punished response. It 
should be noted, therefore, that the predicted 
decrement in verbal response strength is re- 
stricted to the one shock trial of the present 
experiments. It is possible that continued 
shock trials might lead ultimately to increased 
verbal_response strength. 
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difference between Groups E-I and 
E-II may have been due to failure to 
achieve a fairly large and stable re- 
pression with only one shock trial. 
In this case, the present experiment 
did not realize the conditions neces- 
sary for an adequate test of Miller’s 
repression hypothesis. Further, 
while it is doubtful that shortening 
the figure-syllable list by four units 
would have significantly reduced the 
percentage decrement in response 
strength attributable to one shock 
trial, this possibility is not demon- 
strably irrelevant. 


SUMMARY 


In this study several hypotheses 
stemming from the concept of the 
acquired distinctiveness of cues were 
investigated. Stimulus materials 


were a 12-unit figure-syllable paired- 
associate list and a list of the same 12 
figures alone, both of which were pre- 
sented at a 2-sec. rate on a memory 


drum. Additional apparatus con- 
sisted of motor response and shocking 
devices. 

Three matched groups of 15 Ss each 
were used to determine whether the 
acquisition of discriminative verbal 
responses facilitates the acquisition 
of discriminative motor responses to 
the same stimuli. The results indi- 
cated that a group (E-I) which had 
learned to mastery the discrimination 
between pairs of similar figures by 
means of nonsense syllable responses 
acquired discriminative motor re- 
sponses to the same figure stimuli 
significantly more rapidly than groups 
(C-I and C-II) which had been given 
only one and four verbal learning trials 
respectively. 

The motor learning performance of 
Group E-I was superior, although not 
significantly so, to the performance of 
another group (E-III) which had also 
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learned the verbal discrimination to 
mastery and was then instructed to 
verbalize verbal responses overtly 
during the motor learning trials. The 
hypothesis that overt verbalization of 
discriminative verbal responses would 
have greater facilitative effects than 
covert verbalization was not sup- 
ported, but the short time (2 sec.) 
allowed for the verbal and motor 
responses may have been an import- 
ant factor in this result. 

Since the difference between Groups 
C-I and C-II was not significant, the 
hypothesis that, due to intra-list gen- 
eralization, the group with only one 
verbal learning trial (C-I) would learn 
more rapidly than the group with four 
verbal learning trials (C-II), was not 
supported. 

A preliminary investigation of the 
effects of shock for correct or incor- 
rect responses after mastery of a 
verbal discrimination suggested that 
shock occasioned a reduction in the 
immediate retention of the verbal 
responses. It was possible, therefore, 
to investigate the hypothesis that 
repression of the verbal responses 
mediating acquired distinctiveness of 
cues by means of electric shock would 
lower the rate of acquisition of dis- 
criminative motor responses. Al- 
though the direction of differences ob- 
tained was in the predicted direction, 
they were not statistically significant. 


(Manuscript received 
September 16, 1950) 
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ANXIETY AND STRENGTH OF THE UCS 
AS DETERMINERS OF THE AMOUNT 
OF EYELID CONDITIONING 
BY K. W. SPENCE AND JANET TAYLOR! 


State University of lowa 


In a recent experiment by one of 
the authors (9), an attempt was made 
to evaluate the hypothesis that the 
amount of conditioning exhibited in a 
defense conditioning situation is a 
direct function of the level of total 
effective drive. Following Hull (3), 
it was assumed that the value of drive 
(D) operating at the time of occur- 
rence of a conditioned response is 
determined not only by the relevant 
need (i.e., the one which is reduced by 
the response under consideration), 
but also by the aggregate strength of 
all other primary and secondary needs 
operative at the moment. The chief 


implications of this assumption are 
supported by concrete experimental 


evidence (1, 5, 10). 

On the assumption that variations 
in what is described psychiatrically 
as “manifest anxiety” reflect vari- 
ations in generalized drive levels, 
groups of anxious and non-anxious Ss 
chosen on the basis of extreme scores 
on a test of manifest anxiety,? were 
compared with respect to the develop- 
ment of the conditioned eyelid re- 
sponse. The results indicated that 
the anxious group was consistently 
superior in the amount of conditioning 
exhibited throughout the course of 
conditioning trials, the differences 


1 Now at Northwestern University. 

2? The manifest anxiety scale consisted of 65 
items from the Minnesota Multiphasic Personal- 
ity Inventory judged by clinicians to be indica- 
tive of manifest anxiety. Selection of Ss was 
made by taking the upper and lower 15 per cent 
of a distribution of scores of 352 students in 
introductory psychology. A more complete 
description of the test is available elsewhere (9). 


between the two groups being highly 
significant statistically (9). 

The present study represents a 
further investigation of the condition- 
ing of anxious and non-anxious Ss. 
The experiment was designed to pro- 
vide data on the effects of varying 
degrees of intensity of the uncondi- 
tioned stimulus (UCS) on the amount 
of conditioning exhibited by anxious 
and non-anxious Ss, chosen independ- 
ently in the same manner as the previ- 
ous study. 

A study by Passey (7) constitutes 
the only evidence now available in the 
literature concerning the effect of 
variation of the intensity of the air 
puff on the development of the condi- 
tioned eyelid response. Four groups 
of ten Ss were given 50 paired pres- 
entations of a tone and an air puff, 
each group receiving a different inten- 
sity of puff as the UCS. The results 
indicated that the amount of condi- 
tioning was a positive function of the 
intensity of the UCS. According to 
Passey’s analysis, the rate of acquisi- 
tion exhibited a direct relationship to 
the logarithm of the intensity. Un- 
fortunately, none of the groups ap- 
peared to have reached the limits of 
conditioning by the end of the training 
trials so that a comparison of the 
asymptotes of the groups was not 
possible. 

In an as yet unpublished study by 
the present writers (8) a comparison 
was made of conditioning with an air 
puff resulting from a 100-mm. fall of a 
column of mercury with a puff result- 
ing from a 30-mm. fall. The curve 
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for the group with the strong puff rose 
more rapidly to a higher asymptote 
than that for the weak puff group. 
The differences between the total 
number of CR’s for all trials were 
found to be significant between the 
.O1 and .05 levels of confidence. 


PROCEDURE 


Subjects.—A test of manifest anxiety was ad- 
ministered in group form to 574 students in a 
course in introductory psychology. The meas- 
ures, based on 50 items, ranged from a low 
anxiety score of zero to a high anxiety score of 
41%. The selection of Ss for the anxious and 
non-anxious groups was made by taking the in- 
dividuals who scored in the upper 20 per cent 
(score of 24 or above) and the lower 20 per cent 
(score of 9 or below) of the distribution. One 
hundred and twenty-one Ss were run in the 
experiment, of which eight were lost either be- 
cause of apparatus difficulties or E’s error, and 
two because of excessive spontaneous blinking. 
Of the remaining 111 Ss, 11 were discarded, five 
from the anxious groups and six from the non- 
anxious groups because they did not meet certain 
latency criteria that were established as a means 
of eliminating Ss who were believed to be making 
voluntary eyeblinks to the conditioned stimu- 
lus. Quite in contrast to the usual situation that 
has held in conditioning studies in the lowa 
Laboratory, it was found that a high percentage 
(82 per cent) of the Ss recognized and were able 
to state the purpose of the experiment, i.e., as 
concerned with eyelid conditioning. This con- 
trasts very sharply with the percentage of 23 
found by Taylor (9) and approximately 20 per 
cent found in another study (8). 

The reason for the unusual degree of sophisti- 
cation on the part of the Ss was easy to trace. 
It resulted from a lecture given just prior to the 
start of the experiment on the CR in which 
details of the method of recording eyelid CR’s 
were described by slides. From verbal reports 
of the Ss it soon became apparent that many of 
them sought to cooperate and facilitate the con- 
ditioning by voluntarily closing their eye to the 
conditioned light signal and keeping it closed 
until after the occurrence of the UCS. The 
pattern of such responses was quite character- 
istic in form and could be easily differentiated. 
They also tended to be extremely short in 
latency. In order to eliminate such Ss by an 
objective criterion the latencies of all Ss were 


On the basis of an item analysis of the 65 
items in the original scale, the 50 most discrimin- 
ating items were selected for the new scale. 
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examined and the percentage of CR’s less than 
300 msec. in latency was computed for each S. 
The median of this value was found to be 14.3 
per cent. There was a definite bimodality to 
distribution with the distribution dropping to 
zero frequency for percentage scores between 
40 per cent and 50 per cent with a second mode 
appearing above this value. Accordingly, it was 
decided to exclude the data of all Ss who gave 
more than 50 per cent CR’s with latencies less 
than 300 msec. 

Fifty of the 100 remaining Ss were from the 
high (anxious) end of the distribution of anxiety 
scores and 50 from the low (non-anxious) end. 
Half of each of these two groups was given a 
relatively strong puff of air and half a weak puff. 
There were thus four sub-groups of 25 Ss each, 
designated as follows: Anxious strong, anxious 
weak, non-anxious strong and non-anxious weak. 

Apparatus and method of recording.—The S 
was seated in a dental chair in a semi-darkened 
room adjoining that in which the recording ap- 
paratus and stimulus controls were located. 
Fixed in a reclining position in the chair with his 
head braced against a head rest, S was instructed 
to blink on receiving a ready signal, and then to 
fixate on a 6 cm. circular milk glass window or 
disc, placed at a distance of 132 cm. in front of 
him. The brightness of the milk glass disc be- 
tween trials was .05 apparent foot candles. 

The CS consisted in an increase in brightness 
of the disc to 1.51 apparent foot candles. The 
duration of the CS on each trial was 550 msec. 
The USC, a puff of air, followed the onset of the 
CS by 520 msec. and was delivered to the exter- 
nal canthus of the right eye. The air puff was 
produced by the fall of a column of mercury, 
magnetically released, ina manometer. The fall 
was 100 mm. for the strong UCS, and 30 mm. 
for the weak UCS. 

The onset of the CS was controlied by the 
closing of a switch on an electronic timer. 
The latter controlled the duration of the CS and 
the onset of the UCS at the appropriate time 
interval. The onset of the stimuli was recorded 
on a Brush polygraph by means of a Brush 
BL-902 pen motor. 

The movement of the eyelid was recorded by a 
combination of mechanical and electrical means 
that provided a reasonably linear record of the 
position of the eyelid at all times. The method 
is being described in detail for publication else- 
where (4). Briefly, it involved the use of (a) 
a very ‘ightweight microtorque potentiometer, 
(b) a fixed frequency oscillator, (c) an audio- 
frequency amplifier, (d) a full wave rectifier for 
rectifying the amplified signal, (e) a Brush 
BL-902 pen recorder and (f) a Brush polygraph. 

The microtorque potentiometer was mounted 
on an adjustable head band just above S’s eye. 
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A very light lever was attached at right angles to 
the variable (rotating) arm of the potentiometer 
and a mechanical linkage was accomplished be- 
tween it and a tiny plastic lever fastened to the 
upper eyelid by adhesive tape. A small restoring 
spring was added to the potentiometer arm thus 
permitting it to follow the eyelid’s motion. The 
system was very light and involved a minimum 
of work. Movement of the eyelid, with its 
accompanying rotation of the potentiometer arm, 
altered the voltage from an oscillator source ap- 
plied to an amplifier. The output of the ampli- 
fier was then rectified and made to operate the 
Brush pen motor. 

Conditioning procedure.—Each S first received 
three presentations of the light alone in order to 
check for any initial conditioned response tend- 
ency to the light. A single puff of air, unaccom- 
panied by the light, was then administered and 
spontaneous blinking recorded for 40 sec. fol- 
lowing onset of the puff. Immediately following 
the preliminary trials, all Ss were given 100 
paired presentations of the light and air puff. 
Inter-trial intervals of 15, 20, or 25 sec., averag- 
ing 20 sec. and arranged according to a fixed 
schedule, were used. At the end of the session, 
all Ss were questioned as to their understanding 
of the purpose of the experiment and warned not 
to discuss the experiment with other members of 
the class. 


REsuLTs AND Discussion 


Anxiety and conditioning.—Curves 
of acquisition of the CR, based on the 
median per cent of anticipatory re- 
sponses occurring in successive blocks 
of ten trials for the four subgroups are 


presented in Fig. 1. An inspection of 
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Fic. 1. Acquisition curves showing the med- 
ian number of CR’s for the four subgroups in 
successive blocks of ten trials 
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the curves reveals that the anxious 
groups (anxious-strong and anxious- 
weak) tend consistently to be above 
the non-anxious groups. 

The total amount of conditioning 
in 100 trials exhibited by the four 
groups, shown in Table I, further indi- 
cates the superiority of the anxious 
groups. The results of an analysis of 
variance, used to determine the sig- 
nificance of the obtained differences, 
are summarized in Table II. The F 
for the anxious—non-anxious condition 
was found to be 8.59, which is sig- 
nificant at above the .01 level of con- 
fidence. In order to test the signifi- 
cance of the difference between the 
corresponding anxious and non-anxi- 
ous groups, the t-test was employed. 
For the difference between the anxi- 
ous-strong and non-anxious strong 
groups the ¢ was 1.98, and that be- 
tween the anxious and non-anxious 
weak groups was 2.14, both of which 


TABLE II 


ANALYSIS OF VARIANCE FOR TRIALS 








Mean 


Square 


4719.69 
1332.25 
8.41 
549.57 


Source df 





Anxiety 
Strength of puff 
AXS 

Within groups 
Total 











* OS level, F = 3.94; at .01 level. F = 6.90. 
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are significant at the .05 level of 
confidence.‘ 


The superior conditioning of the anxious 
groups confirms the results of the previous 
study (9). Although the strength of puff used 
in the original investigation was somewhat less 
intense than that employed with the present 
strong puff groups, it is possible to make a rough 
comparison of the conditioning of the strong puff 
groups in the present study and those in the 
previous investigation. 

A comparison of the two sets of non-anxious 
Ss reveals that the present non-anxious group 
exhibited a greater amount of total conditioning 
(39.7 vs. 27.9 per cent CR’s in 80 trials). While 
it is reasonable to expect the non-anxious strong 
Ss to exhibit a slightly higher amount of condi- 
tioning because of the more intense air puff, it is 
possible to suggest a further factor that may have 
contributed to such a difference. As was men- 
tioned previously, a large number of Ss (82 per 
cent) in the present investigation were aware of 
the purpose of the experiment and appeared to 
adopt a positive set that facilitated the condi- 
tioning. Although an attempt was made to 
eliminate Ss who exhibited voluntary responses 
to the light, it is possible that the presence of this 
facilitatory set in many of the Ss resulted in a 
higher amount of conditioning than would be 
found in more naive Ss. Ample experimental 
evidence indicates the influence that set may 
have on the development of the CR and tends to 
support this suggestion (2, 6). 

With respect to the anxious Ss, however, the 
anxious-strong group of the present investigation 
exhibited considerably less conditioning than the 
anxious Ss in the first study, in spite of the 
greater intensity of puff (39.96 vs. 59.60 per cent 
CR’s in 80 trials). However, it should be noted 
that the criterion for inclusion in the anxiety 
group in the original experiment was a percentile 
rank of 89 or above on the anxiety scale, as com- 
pared to a percentile rank of only 80 in the 
present investigation. In order to check the 
possibility that the discrepancy in the range of 
anxiety scores in the two studies may have been 
responsible for the differences in the condition- 
ing of the two sets of anxiety Ss, the conditioning 
scores for only those anxious-strong Ss with 
percentile ranks of 89 or better on the 
anxiety scale were computed. The amount of 
conditioning exhibited by this restricted anxiety- 
strong group was quite comparable to that of the 
anxious Ss in the original study, being 60.3 per 
cent for 80 trials. 


4 The results were also analyzed by the Mann- 
Whitney U-test. The differences between the 
anxious and non-anxious groups were found to be 
between the .02 and .05 levels of significance. 
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Strength of UCS and conditioning.— 
It is clear from an examination of the 
acquisition curves presented in Fig. 1 
that the anxious and non-anxious 
strong puff groups tended to condition 
somewhat better then did the cor- 
responding weak puff groups, par- 
ticularly in the latter half of the train- 
ing period. This superiority of the 
strong Ss is further revealed in the 
total amount of conditioning of each 
group, shown in Table I. However, 
the F obtained from the analysis of 
variance was 2.42, a value which does 
not reach significance at the .05 level 
of confidence. 

The failure to find a significant 
difference between the strong and 
weak puff conditions contrasts sharply 
with another study (8) the writers 
have conducted in which the groups 
were selected at random from an intro- 
ductory psychology class rather than 
by means of anxiety scores. In this 
investigation, in which the strengths 


of puff employed were the same as 
those utilized in the present experi- 
ment, the strong puff group exhibited 
markedly superior conditioning, es- 


pecially in the first 50 trials. The 
results of Passey’s study (7) also 
clearly demonstrated that strength 
of the unconditioned puff stimulus is 
a factor determining amount of con- 
ditioning. 

One possible explanation of the 
failure to find significant differences 
in the present experiment is that this 
variable (strength of UCS) is not 
effective in the case of Ss at the ex- 
treme ends of the anxiety scale. In 
other words, the differences in strength 
of the UCS produce no marked differ- 
ence in the level of drive (anxiety) of 
extremely anxious and extremely non- 
anxious subjects. 

Another difference between the 
present study and that concerned with 
the effects of puff strength (8) was the 
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factor, already mentioned, of degree 
of sophistication of the Ss. The 
presence of a large number of sophisti- 
cated Ss in the present experiments 
may have obscured the differential 
effects of the strength of puff on the 
development of the CR. 


THEORETICAL INTERPRETATION 


One possible interpretation of the 
difference in amount of conditioning 
exhibited by the different groups of 
Ss is that these groups, in Hull’s 
terms, develop habit strength (spr) 
at different rates and reach a different 
level of maximum habit strength. 
This is the suggestion that Passey (7) 
made concerning the effects of in- 
creased intensities of puff that he ob- 
tained. It should be noted, however, 
that according to Hull’s theoretical 
formulation, the response measure is a 
function of both habit and drive 
strength: R = f(H1 XD). Thus, it 


is theoretically possible for the habit 
growth curves of the different groups 
of Ss to be the same and yet have their 
response curves increase at different 
rates to different asymptotes because 
of different drive (D) values. 


Whether the different response 
measures obtained for the different 
groups in Passey’s experiment and in 
the present study reflect differences 
in (a) habit strength, (b) drive 
strength or (c) both of these inter- 
vening variables, cannot be answered 
by the type of experimental design 
employed. In a further experiment 
conducted by the present writers (8) 
a specific attempt is being made to 
obtain evidence as to whcther differ- 
ences in habit strength are produced 
by different intensities of the UCS. 


SUMMARY 


The experiment was designed to 
provide data on the effects of strong 
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and weak intensities of the UCS on 
the amount of conditioning exhibited 
by anxious and non-anxious Ss, se- 
lected independently of the experi- 
mental situation. 

Two groups of 50 Ss were chosen on 
the basis of extreme scores made on a 
test of manifest anxiety. The groups 
were run in a conditioned eyelid situ- 
ation. Half of each group received a 
relatively strong puff of air as the 
UCS, the remaining half of each group 
a relatively weak puff. 

Analysis of the results indicated 
that the anxious subgroups were con- 
sistently superior to the corresponding 
non-anxious groups in amount of 
conditioning throughout the course of 
the conditioning trials, the difference 
between the groups being significant 
statistically. Comparison of the 
strong- and weak-puff groups revealed 
that although the strong group ex- 
hibited a larger number of CR’s 
throughout the conditioning, the dif- 
ferences from the corresponding weak 
groups were not significant. The 
results are discussed in terms of Hull’s 
constructs of habit (sHx) and drive 
strength (D). 


(Manuscript received October 2, 1950) 
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MEMORY FOR PLEASANT AND UNPLEASANT 
EXPERIENCES: SOME METHODOLOGICAL 
CONSIDERATIONS 


BY RALPH H. TURNER AND JOHN A. BARLOW 
Oberlin College 


The early work in this field was 
characterized by findings which con- 
tained so many inconsistencies that it 
became apparent that many of the 
factors relevant to the recall of pleas- 
ant and unpleasant experiences were 
not being adequately controlled. 
More recent experimentation has had 
the merit of incorporating the findings 
of previous work, with the result that 
today’s findings contain fewer ambigu- 
ities. In this paper, the authors 
present the results of an experiment 
which points to two further respects 
in which inadequate methodology has 
characterized work in this field. 


There are many respects in which experi- 
ments in this area have been inadequately con- 
trolled. (a) Early investigators attempted to 
use the very unreliable questionnaire method in 
which Ss were asked whether they recalled pleas- 
ant or unpleasant experiences more readily. 
(b) In some instances as few as 15 Ss have been 
used in the experimental work. (c) Words have 
often been selected by £ as pleasant and unpleas- 
ant without determining whether the Ss per- 
ceived them to be pleasant and unpleasant as 
assumed. (d) Some procedures have involved 
presenting Ss with material to be learned, the Ss 
later being asked to designate which material 
was pleasant and which was unpleasant. Such 
a procedure runs the risk that S$ will tend to 
designate as pleasant the material which was 
more readily learned. (e) Artificial laboratory 
situations, involving pleasant or unpleasant 
odors or colors or other sensations, have been 
used in inves igating the retention of P and U 
experiences. (f) Many Es who saw the inade- 
quacy of using pleasant or unpleasant odors or 
colors in artificial laboratory experiences never- 
theless used words derived from Thorndike’s 
word list or some other source. (g) Much work 
has confined itself to a simple rating of the re- 
called material as to pleasantness or unpleasant- 


ness without a rating of the intensity accompany- 
ing such feeling. (h) Many experiments, such 
as those of Kowalewski (6) and Wohlgemuth (13), 
have asked for the recall of events following a 
vacation period when the affective tone of recent 
experiences is more likely to be judged to be 
pleasant. (i) Some work has been done on the 
assumption that the recall of a larger number of 
P than U words is sufficient to justify the con- 
clusion that P experiences are remembered 
better than U experiences, but the work of 
Fligel (3) has shown the probability that more 
experiences judged to be P are available for recall. 
(j) Conclusions have often been drawn on the 
basis of one immediate recall, but Gilbert (4) has 
noted that delayed recall periods give opportu- 
nity for the relevant factors under investi- 
gation to operate and thus yield results more 
likely to correspond with the delayed recall 
found in everyday life situations. (k) Some 


Es have failed to note the fact that many 


experiences recalled during the second or 
third recall period were “new experiences” in 
the sense that they were not reported during the 
first recall period. This tendency to lump to- 
gether the newly reported with previously re- 
ported experiences has obscured consistencies in 
the data which appeared when the data did not 
contain these contaminations. (1) The age of 
the Ss has been overlooked as a relevant variable 
by most investigators with the result that there 
have been uncritical comparisons made between 
data derived from experiments using persons of 
widely divergent age groupings. (m) Intensity 
judgments have been made separately within 
the P and U categories on the assumption that 
corresponding points on the P and U intensity 
scales represent equivalent intensities. The 
study of P and U experiences with presumably 
equivalent intensities has thus led to the as- 
sumption that reported retention differentials 
could be attributed only to the q :alitative differ- 
ences inherent in P and U experiences. (n) 
Temporal sequence in the recall phase of the 
experiment is a factor which has not been given 
adequate attention in experimental methodology. 
Since the present experiment was designed to 
throw light on the problems raised by the last 
two criticisms, both need elaboration. 
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The Intensity Variable 


Jersild (5), Sharp (9), O’Kelly and 
Steckle (8), Stagner (10), and Thom- 
son (11) made no systematic study of 
intensity. Barrett (1) had Ss rate P 
and U words separately on a scale of 
one to three and concluded that the 
mean intensity of P and U words was 
the same because the mean scale 
rating of P and U words was not sig- 
nificantly different. This procedure 
is hardly justified since there is no 
way of knowing that the subjective 
one-to-three intensity scale for P 
experiences was distributed along the 
same intensity steps as the one-to- 
three scale used for judging U experi- 
ences. Cason (2) had his Ss rate 
experiences on a scale from extremely 
pleasant through indifferent to ex- 
tremely unpleasant. With such a 


procedure there is no way of deter- 
mining the extent to which an intens- 
ity on the P scale could be considered 
the equivalent of the corresponding 


point on the U intensity scale. 
Menzies’ (7) work is subject to the 
same criticism since he had five intens- 
ity steps for pleasant and five minus 
steps for unpleasant experiences but 
had no way of determining intensity 
equivalents for his P and U scales. In 
the absence of knowledge to the con- 
trary, the literal equivalence of cor- 
responding points on intensity scales 
has often been assumed. 

Perhaps the best handling of the 
intensity variable has been found in 
the work of Waters and Leeper (12). 
They used what amounted to a three- 
point intensity scale for P and for U 
experiences. By means of a refining 
procedure, which involved the utiliza- 
tion of only those experiences recalled 
the first and the second time with the 
same intensity rating, they obtained 
results free from the ambiguities char- 
acteristic of much previous work. 
Even this refinement, however, does 
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not assure the comparability of cor- 
responding points on P and U intens- 
ity scales. The fact that Ss were 
consistent from one time to the next 
in their intensity judgments of words 
points only to the reliability of judg- 
ments made separately within the P 
intensity scale and within the U in- 
tensity scale without in any way 
assuring us of comparability of cor- 
responding points on these scales. 
A possible alternative to these meth- 
ods of handling the intensity variable 
is detailed in the section on “proce- 
dure.” 


The Temporal Sequence Variable 


Previous experiments have rec- 
ognized the importance of temporal 
sequence in the learning phases by 
alternating P and U _ experiences. 
The same cannot be said of the recall 
phases of the experiments. In the 
classical experiment of Jersild (5), as 
well as in the experiments reported by 
Kowalewski (6) and Wohlgemuth 
(13), this factor was ignored as a 
variable. In the experiment of 
O’Kelly and Steckle (8) this variable 
is apparently recognized but the data 
are not analyzed to determine its im- 
portance. Moreover they do not 
report the absolute or relative size of 
the groups in which counterbalanced 
order was used so that it cannot be 
determined whether the factor of 
temporal sequence has been balanced 
out by pooling the results derived from 
the two groups. 

Stagner (10) asked his Ss to list 
their P and U experiences on separate 
sheets but gave no indication as to 
which was requested first or whether 
any consistent plan was followed. 
Young (14) asked his Ss to put 
“pleasing” words in one column and 
“displeasing” in another. For three 
groups the pleasing words were to be 
listed in the left column, and in one 
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group the situation was reversed and 
the words “pleasing” and “displeas- 
ing” were interchanged in reading the 
directions. But since the Ss were 
permitted to recall words in whatever 
temporal sequence they chose, there 
is no way of knowing whether the 
effects of a temporal sequence were 
balanced out. Unless there are speci- 
fic directions to the contrary, there is 
a tendency to recall pleasant words 
first and then unpleasant, probably 
because the phrase “pleasant-unpleas- 
ant” suggests this sequence of opera- 
tions. The fact that most Es have 
followed this “P then U” sequence, 
testifies to the validity of this assump- 
tion. The following section on “pro- 


cedure” describes a method of in- 
vestigating the importance of the 
temporal sequence variable. 


PROCEDURE 


The procedures followed in conducting this 
experiment will be given fully in order that the 
results may be capable of interpretation by those 
who choose to define terms operationally. A 
total of 250 students taking introductory psy- 
chology served as Ss. In order to study the 
intensity variable and the temporal sequence 
variable, these Ss were divided into three groups. 
Table I indicates the procedures and the number 
of Ss in each group. 


TABLE I 


EXPERIMENTAL CONDITIONS 








No. of Subjects 





Group Instructions 
Listing| Listing 
P First| U First| 1°! 





Rank P experiences in 
intensity from most P = 
as No. 1 to least Pas| 34 23 57 
N. Same with U ex- 
periences. 


Same procedure as for 
Goup A. 67 59 


Rank all experiences, 
whether P or U, on 
one scale, from most 
intense as No. 1 to 
least intense as N. 
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After distributing papers bearing identifying 
numbers, the instructions to the Ss at the time 
of the first (original) recall were given as follows: 

The E said to all groups: “For a few minutes 
this morning we are going to conduct a survey of 
your recent emotionally tinged experiences. 
You will be given 4 min. in which to record all 
of the most (pleasant or unpleasant in accord- 
ance with Table I) experiences you have had 
during the past week. It is not necessary for 
you to write out a complete description of each 
incident but only a sentence or perhaps a word 
or phrase which will permit you to identify the 
experience later. Be as frank and as complete 
as possible in the time allotted. As your names 
are not written on these papers, your responses 
are completely anonymous.” 

After a pause of 4 min., E said to Groups A 
and B only: “‘Now please rank these experiences 
in intensity from most (unpleasant or pleasant) 
as number one to the least (unpleasant or pleas- 
ant).” 

All groups, including Group C, were then 
given 4 min. in which to record the experiences 
not asked for the first time, i.e., P if U had been 
asked for the first time, and vice versa. Groups 
A and B then ranked these in intensity. 

The E then said to Group C only: “Now 
please rank all of these experiences in intensity 
from the most intense, as number one, to the 
least intense. For example, if the pleasant ‘date’ 
last Friday night was the most intense experience 
you have recalled, it would be ranked number 
one. Ifthe unpleasant history bluebook was the 
second most intense, it would be ranked as 
number two, etc.” 

Fourteen days later, two sets of papers were 
prepared having the same identifying numbers as 
those used on the original papers. One set was 
blank while each paper of the second set con- 
tained all of the original responses made by a 
specific S in random order with P and U re- 
sponses intermixed. Two additional experi- 
ences were added to those supplied previously by 
S. These were modified in all cases to fit the 
general tense and form of the other responses on 
each individual S’s paper. 

The instructions to all Ss were as follows: 
“Come forward and find the paper which has 
your number on it (offering a stack of papers 
that were blank except for the identifying num- 
bers). First you will be given 2 min. in which to 
list all of the (pleasant or unpleasant) experiences 
you recorded originally that you can now recall.” 
(Pause of 2 min.) “Now please list all of the 
(unpleasant or pleasant) experiences you recorded 
originally that you can now recall.” (Pause of 
2 min.) 

The papers were collected and these instruc- 
tions were given: “Please take the paper which 
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has your number on it (indicating the papers 
containing the listing of their original responses). 
The experiences which you recorded originally 
have been re-arranged and several others have 
been added. Please identify all the experiences 
on your list which you are reasonably certain you 
put on your original paper by placing a ‘P’ before 
it if you remember it now as pleasant or a ‘U’ if 
you remember it now as unpleasant. If you 
recognize any experiences as those you recorded 
but no longer feel that they are particularly 
pleasant or unpleasant, please mark them with an 
‘N.’ Do not mark any experiences that you do 
not believe you recorded on the paper two weeks 
ago.” 

Then £ said to Groups A and B only: “Now 
please rank the pleasant experiences—those you 
have marked ‘P’—from the most pleasant as 
number one to the least pleasant as you now feel 
about them. (Pause.) Follow the same proced- 
ure with the unpleasant experiences, ranking 
them from number one as most unpleasant to 
least unpleasant.” 

The E said to Group C only: “Now please 
rank all experiences from most intense as number 
one to least intense as you now feel about them. 
For example, if the pleasant ‘date’ was the 
most intense experience, it would be ranked as 
number one. If the unpleasant history blue- 
book was the second most intense, it would be 
number two, etc.” _ 


REsULTs 


The utilization of personal experi- 
ences avoids the criticism that at- 
taches to the frequent use of word lists 
but there still remains the question as 
to whether the assurance of anonym- 
ity was sufficient to permit the recall 
of material sufficiently related to per- 
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Schoolwork and grades 3. 5 13.2 
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TABLE III 


Memory FoR PLEASANT AND UNPLEASANT 
EXPERIENCES WITHIN Two CoNnTRASTING 
ReEcaLt SEQUENCES 








Pleasant Responses Unpleasant Responses 





Group! Original % Original 
No. called No. 
(ist Re- | (2nd Re- (ist Re- 
call) call) call) 


% Re- 
called 
(2nd Re- 
call) 





Pleasant Experiences Recorded First 





| | 

A 40.1 | 158 | 
B 56.1 | 287 
C 56.2 | 122 

ee 51.6 567 | 








Unpleasant Experiences Recorded First 





| 148 s¢ | 103 
315 5 | 279 
i 193 

| 


575 


245 
708 


e255 


Total 








Combined 





Total 49.6 


| | 
1535 | 46.2 | 1142 
! 





sonal life experiences to have genuine 
affective accompaniments. In Table 
II the experiences recalled are classi- 
fied in an attempt to answer this 
question. 

It would appear that a variety of 
genuine personal experiences have 
been tapped and that many of them 
have the affective accompaniments 
that one presumes should accompany 
genuine P and U experiences. The 
rather large number of unclassified 
responses resulted from our suggestion 
that S might use a word or phrase 
which would recall the incident to 
him but conceal its nature from us. 
It is assumed that a great many of 
these responses were of a personal 
nature, and the fact that many chose 
to use this protective device encour- 
ages the belief that the responses were 
personal and involved a substantial 
affective element. 
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TABLE IV 


Numser or P anp U Responses 
ON THE SEcoND RECALL 








Condition Mean 








P responses 
(U requested first) 2.35 
P responses 
(P requested first) 3.28 
U responses 


(U requested first) 2.61 








U responses 
(P requested first) 








1.95 i283 








Table III gives the number of P 
and U experiences recalled originally 
and at the second recall for all groups. 
While analysis of such data might 
well avoid percentage calculations, 
this practice is continued because 
previous research with which ours is 
compared was reported in this man- 
ner. 

The influence of temporal sequence 
is apparent. When P experiences 
were recalled first, 51.6 per cent of the 
previously reported P experiences 
were recalled. But when the U ex- 
periences were recalled first, only 39.8 
per cent of the previously reported P 
experiences were recalled. In the 
case of the U experiences, although 
54.4 per cent of the previously re- 
ported U experiences were recalled 
when U experiences were recalled 
first, only 44.6 per cent were recalled 
when U experiences were recalled 
second. 

To test the statistical significance of 
temporal sequence the number of re- 
sponses made by each S was treated 
as a score and the distribution of these 
scores for the several ‘conditions was 
obtained. From these distributions, 
the data in Table IV were derived. 
The magnitude of the critical ratios 
indicates the importance of temporal 
sequence as a variable. It is hardly 
surprising that Jersild (5) and others 
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who asked for pleasant experiences 
first should obtain results indicating 
the more favorable retention of pleas- 
ant experiences. 

Aside from its importance in the 
context of experimental control, the 
reason for this finding can only be 
speculated upon. Greater recall de- 
crement in the second half of the recall 
period might suggest that fluctuation 
of attention, fatigue, monotony, or 
boredom are entering in. Or it may 
be suggested that the first portion of 
the recall period sets up proactive in- 
hibitory influences which inhibit re- 
call during the latter portion. This 
experiment does not provide definitive 
information on this point. 

Table III shows that when the 
effect of temporal sequence is mini- 
mized by combining all data for P 
experiences, whether recalled first or 
second, 46.2 per cent of the P experi- 
ences were recalled while 49.6 per cent 
of the U were recalled. It would ap- 
pear that when the effect of temporal 
sequence is minimized, there is no 
clear-cut evidence that the type of 
affective experience is related to the 
ability to remember such experiences. 
The term “minimized” is used in rec- 
ognition of the fact that the experi- 
mental groups combined to minimize 
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Fic. 1. Second recall as a function of intens- 
ity ratings made at time of first recall by groups 
A and B. 
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this effect were not absolutely identi- 
cal in size and a negligible error re- 
sults. 

Figure 1 permits a comparison of 
the recall of P and U experiences for 
various degrees of intensity for Groups 
Aand B. It is evident that the recall 
of both P and U experiences is mark- 
edly related to the intensity rating 
given these experiences by the Ss. 
Figure 1 indicates that recall drops 
off rapidly as one deals with experi- 
ences several intensity steps removed 
from the most intense experiences. 
The fact that the curves for P and U 
experiences are very nearly superim- 
posed reflects our finding that when 
temporal sequence is minimized by 
using a counterbalanced order, P and 
U experiences are recalled with nearly 
equal facility. Figure 1 permits us to 
generalize this finding by showing that 
it holds for all levels of intensity in- 
vestigated. The total number of P 
experiences recalled on the first recall 
varied from 183 for those having an 


intensity rank of 1 to 10 for those 


having an intensity rank of 1l. The 
number of U experiences varied from 
182 for an intensity rank of 1 to 4 for 
an intensity rank of 9. 

Figure 2 depicts the relationship 
between incidence of recall and intens- 
ity of experience (regardless of P or 
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Fic. 2. Second recall as a function of intens- 
ity ratings made at time of first recall by Group 
C (P and U experiences rated together in in- 
tensity). 
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U) as judged at the time of the origi- 
nal recall. A similar figure results 
when intensity, as judged at the time 
of second recall, is plotted against 
incidence of recall. The total num- 
ber of experiences recalled on the 
first recall varied from 67 for those 
having an intensity rank of 1 to 11 
for those having an intensity rank of 
15. 

These data are of interest for sev- 
eral reasons. They indicate that Ss 
are capable of performing a set of 
operations in which they are required 
to disregard the qualitative nature of 
the experiences and attend primarily 
to the intensity of the experiences. 
Only four Ss had difficulty doing this, 
and it was due principally to a mis- 
understanding of directions. That 
primary attention was given to in- 
tensity rather than the qualitative 
nature of the experience was indicated 
by analyzing separately the intensity 
ratings given to P and U experiences. 
In this analysis, the first most intense 
P experience was assigned intensity 
rank No. | and the next most intense 
P experience was given intensity rank 
No. 2 regardless of the U experiences 
the S had interposed between them 
in making the combined ranking. 
The same was done for the U experi- 
ences. In this way the data were 
transformed so that they could be 
compared with the data obtained 
from Groups A and B where the P 
and U experiences had been sepa- 
rately ranked in intensity by the Ss. 
When this was done, the data looked 
approximately like those already pre- 
sented for Groups A and B. This is 
taken as evidence that the making of 
intensity ranks of mixed P and U 
experiences, disregarding differences 
of qualitative nature, does no violence 
to judgmental processes already ac- 
cepted as valid for the separate rating 
of P and U experiences. The effect 
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is one of distributing these qualita- 
tively different experiences along the 
same intensity continuum without ap- 
preciably altering the intensity posi- 
tions they would be assigned if they 
were to be judged by comparing them 
only with qualitatively similar experi- 
ences in the usual fashion. 

This conclusion has practical impli- 
cations for any attempt to determine 
the relationship between incidence of 
recall and affective tone with intensity 
held constant. Past attempts have 
characteristically shown incidence of 
recall for P and U experiences plotted 
in the same figure against a common 
intensity scale. This technique 
makes the unwarranted assumption 
that the same numerical values on the 
P and U intensity scales represent 
intensity equivalents. Since ortho- 
dox methodology has required the 
ranking of P and U experiences in 
separate operations, this assumption 
is highly questionable. The proced- 
ure described here appears to meet 
this objection, and Fig. 3 gives inci- 
dence of recall for P and U experi- 
ences where the intensity ratings of 
both types of experience were made 
on the same intensity scale and then 
intensity equivalents for P and U ex- 
periences were derived in the manner 
just described. 

Assuming that P and U experiences 
alternate with each other, the mean 
discrepancy in intensity between simi- 
larly ranked P and U experiences 
plotted on the same intensity scale 
could be no greater than the intensity 
difference between adjacent ranks on 
the combined intensity scale. Since 
the procedure of ranking both P and 
U experiences on the same intensity 
scale divides the intensity range into 
approximately twice the usual num- 
ber of divisions, the mean intensity 
difference between ranks on the com- 
bined scale is considerably less than 
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Fic. 3. Recall of P and U experiences with 
the factor of intensity approximately equated at 
each intensity level (Group C) 


the difference between ranks on either 
the P or the U scale where intensity is 
ranked independently. The error is 
actually much less than this since 
many Ss are used, and approximately 
half the P ranks will be half an intens- 
ity step higher than similarly ranked U 
experiences while half will be half an 
intensity step lower than similarly 
ranked U experiences. Occasionally, 
the alternation of P and U experiences 
on a combined intensity scale will be 
interrupted by a sequence of several 
Por Uexperiences. In this event, the 
required degree of accuracy can be 
achieved by utilizing only those Ss 
having P and U experiences for which 
there is always an adjacent experi- 
ence belonging to the other qualitative 
category. The shrinkage of data re- 
sulting from this latter step would 
necessitate more Ss than were used 
in the present experiment. 


SuMMARY AND CONCLUSIONS 


Despite methodological advances in 
memory experiments, there remain 
two aspects of methodology which 
have not received the proper atten- 
tion: (a) the importance of the tem- 
poral sequence in which P and U ex- 
periences are requested has not been 
adequately studied; and (b) the 
methods of equating the intensity of 
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P and U experiences have been based 
on the questionable assumption that 
similar intensity ratings on two sepa- 
rate scales (P and U) represent inten- 
sity equivalents. 

The importance of sequence of re- 
call was demonstrated by having some 
groups recall P experiences first while 
other groups recalled U experiences 
first. When P experiences were re- 
called first, a higher percentage of P 
than U experiences was recalled. 
When U experiences were recalled 
first, a higher percentage of U than P 
experiences was recalled. The num- 
ber of responses made by each S under 
each of these experimental conditions 
provided score distributions which 
differed significantly. When the 
factor of temporal sequence in recall 
was minimized by using a counter- 
balanced order, there was little differ- 
ence found in the ability to recall P 
as opposed to U experiences. 

In order to equate intensity ratings 
of P and U experiences, it was found 


possible to have Ss rate these experi- 
ences upon the same intensity scale 
by having them rank the most intense 
experience, whether P or U, as No. 1, 
and the next most intense experience, 


whether P or U, as No. 2, etc. Rea- 
sons are presented for believing that 
this procedure permits a much closer 
approximation to the establishment of 
intensity equivalents than does the 
usual procedure of rating intensity 
independently for P and U experi- 
ences. 

Recall was found to be a function 
of intensity of the experience, whether 
Por U. With the factor of intensity 
approximately equated at each in- 
tensity step, the percentage of P and 
U experiences recalled was plotted 
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against intensity. The resulting 
curves for P and U experiences diff- 
ered only slightly. 


(Manuscript received October 2, 1950) 
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MEDIATED STIMULUS GENERALIZATION AS A FACTOR 
IN SENSORY PRE-CONDITIONING 


BY DELOS D. WICKENS 
Ohio State University’ 


AND 
GEORGE E. BRIGGS 


University of Wisconsin 


The paradigm for sensory pre-con- 
ditioning experiments is as follows: 
The S is simultaneously stimulated 
with two neutral stimuli (e.g., light 
and tone). Following a number of 
such pairings one of these stimuli is 
made the conditioned stimulus for 
some response. Then in the crucial 
test the other or “non-conditioned” 
stimulus is presented. A majority of 
Ss so treated will give a conditioned 
response to the “non-conditioned” 
stimulus, while control Ss who do not 
receive the pretraining fail to do so 
(1, 2, 3, 4). 

Taken at face value, these experi- 
ments would seem to offer evidence 
against an S-R approach. They ap- 
pear to do so because the pre-condi- 
tioning procedure seems to consist 
solely of an association of stimuli. 
The learning would appear to be of 
an S-S nature rather than an S-R 
nature. The S-R theorist may point 
out, however, that some response 
must be made to these paired stimuli. 
What is even more significant is the 
fact that, because the stimuli occur 
simultaneously, the identical response 
or responses are given to both stimuli. 
The responses will vary from trial to 
trial, but on any one trial the response 
that occurs following one stimulus 
also occurs following the cther.? If 


1 This research was supported in part by a 
grant from The Graduate School of The Ohio 
State University. 

2 This theoretical analysis, although it is S-R 
in nature, does not necessarily assume reinforce- 
ment as essential for S-R formation. It is quite 


this be true then sensory pre-condi- 
tioning becomes a special case of 
mediated stimulus generalization (6). 
The transfer that occurs may be con- 
sidered as arising not from the fact 
that the two stimuli had been associ- 
ated together, but because a common 
and mediating response had been 
learned to both. 

An indirect method of testing this 
hypothesis is to train Ss to make the 
same response to the two stimuli pre- 
sented in isolation and compare the 
transfer value with that obtained from 
Ss who make the same response when 
the stimuli are presented together. 
The hypothesis would predict that 
these two groups of Ss would behave 
in essentially the same fashion in the 
crucial tests. The control Ss would 
be required to learn a different re- 
sponse to the two stimuli presented in 
isolation. These Ss should show few 
if any responses in the crucial tests. 
This is the method of evaluating the 
hypothesis which was employed in 
the present experiment. In brief, the 
problem was to investigate the hy- 
pothesis that sensory pre-conditioning 
may be a special case of mediated 
stimulus generalization. 


METHOD 


Subjects.—Forty students from the elementary 
psychology classes at Ohio State University 


possible that the reinforcement theorist could 
find evidence of reinforcement in the pre-condi- 
tioning procedure. The present experiment 
does not deal with this problem in any way. 
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served in this experiment. They were selected 
at random from the class population and as- 
signed at random to each of the four groups em- 
ployed in the experimental design. 

Apparatus.—The S was placed in a semi- 
soundproofed room which was adjacent to the 
control room. The experimental room con- 
tained a standard classroom chair with a finger 
avoidance conditioning apparatus placed on S’s 
right and a light source placed at approximate 
eye level when S was seated. A 60-w. light 
provided illumination during the instruction 
periods only. 

The palm of S’s right hand was strapped to a 
large copper plate and the middle finger of that 
hand rested upon another small copper platform. 
During the conditioning trials, this arrangement 
formed a circuit which wouid result in the receipt 
of an electric shock if S’s finger did not fly up and 
break the circuit. 

A standard telegraph key was placed in an 
inverted position just above the finger which was 
resting on the small copper plate. If S raised 
his finger approximately } in., the key was closed 
and no shock could be transmitted through the 
circuit. Breaking the circuit stopped a Standard 
Electric Time Clock, which started with the 
onset of any stimulation. This chronometer 
measured response latency in units of 1/100 sec. 

The tone had a frequency of 1,000 cps and 
was produced by a Jackson Audio-Oscillator, 
Model No. 655. This tone was transmitted to 
S through the headphones. The intensity was 
approximately 40 db above normal threshold. 
The light stimulation was initiated from a DC 
power source which activated a 6-8 v. bulb. 
This power source also supplied the shock used 
in the conditioning trials, and this shock was 
variable up to 100 v. An electronic delay ap- 
paratus controlled the temporal relationship be- 
tween the CS and the UCS. 

General procedure.—The S was first brought 
into the experimental room, seated, and given 
the pre-conditioning training to the light and the 
tone. This constituted step one. In step two, 
S was given 30 finger avoidance-conditioning 
trials with the tone as CS and the shock as UCS. 
The third step consisted of ten trials in which 
the light was presented alone. This is, of 
course, the crucial test period for sensory pre- 
conditioning. The fourth and final step which 
followed immediately consisted of ten additional 
trials to the tone. This last step was to test for 
possible extinction of the conditioned response. 
The total time required for the entire session was 
approximately 45 min. 

Pre-conditioning period—The treatment of 
the Ss in each group was exactly the same with 
the exception of step one, the pre-conditioning 
training. Group I made a common response to 
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the stimuli when they appeared simultaneously 
in the pre-conditioning series. Group II made 
a common response to each of the two stimuli 
when they appeared separated in time. Group 
III made the response to only one of the stimuli 
(the light) and not to the other (the tone). 
Group IV made the response to the tone and not 
to the light, thus making this group comparable 
to the third group in that the response was made 
only to one of the two stimuli when they ap- 
peared separately in time. 

The common response used by all groups was 
the verbalization of the word “now.” The in- 
structions given to Group I for the pre-condition- 
ing period were as follows: 


This first section of the experiment is what 
is called a reaction time experiment. Dur- 
ing this section, this light and a tone from 
the headphones which you will be wearing, 
will come on at exactly the sametime. Just 
as s00n as you are sure that you see the light 
and hear the tone, I want you to respond by 
saying the word “now” in a loud enough 
tone of voice so that I can hear you in the 
next room. This will give me an indication 
of your speed of reacting to each of these two 
stimuli, or in other words, I will have a meas- 
ure of your reaction time. This will take 
approximately 5 min. and then I will give 
you the instructions for the second part of 
the experiment. 


The instructions given to the remaining groups 
were the same except for the modification re- 


quired by the difference in treatment. Group 
II was told that they would hear a light some- 
times and a tone other times and that they were 
to respond to either. Groups III and IV were 
told to respond to one of the stimuli and not to 
the other. 

There were 15 trials of the light and tone for 
Group I, and for Groups II, III and IV there 
were 30 trials, 15 tones and 15 lights. As indi- 
cated in the instructions, it was assumed that 
Ss of Group I would make the 15 responses to 
both the light and the tone, and thus Groups II- 
IV were assumed to be comparable in terms of 
the number of reactions made to each stimulus. 

The interval between trials in the pre-condi- 
tioning section varied randomly from 12 to 24 
sec. In Groups II-IV the light or tone were 
presented randomly. No actual measure of 
reaction time was taken during these trials. 

Conditioning period.—Following the last trial 
of the first session, E returned to the experi- 
mental room and read the following instructions 
to all Ss. 


This part of the experiment is to be a con- 
ditioning study. During the experiment, 
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your right hand will be strapped to this elec- 
trode, and your middle finger will rest on 
this electrode underneath the key. 

You will receive a shock which will pass 
from the front electrode to the one in the 
rear through your finger. However, you 
can break the shock by lifting your finger 
from the front electrode and closing the key 
in this manner. 

During the experiment, a tone will precede 
the shock by a regular interval of time, and 
if you get conditioned, you will develop a 
tendency to respond to the tone before the 
shock goes on. I do not want you to fight 
against becoming conditioned, and at the 
same time, I do not want you to respond 
voluntarily to the tone. If your finger 
wants to fly up, just don’t inhibit it. 


The S’s right hand was then placed in the 
apparatus and the key was adjusted so that 
about } in. upward movement closed the circuit. 
The E then left the room and the conditioning 
trials were begun. The time intervals between 
the trials for conditioning varied in a random 
manner from 15 to 45 sec. Thirty reinforce- 
ments were given. 

Test for sensory pre-conditioning.—The final 
test period consisted of ten presentations of the 
light alone, the stimulus which had not been 
directly employed in the conditioning process. 
This was followed by ten final trials to the tone, 
the directly conditioned stimulus. There was 
no temporal break between the conditioning and 
the test series. The time intervals for this test 
were exactly the same as for the conditioning 
trials. 


RESULTS 


In Table I is presented the mean 
number of responses during the crucial 
test to the non-conditioned stimulus, 
and standard deviations for each of 
the four groups. All differences be- 
tween means are significant beyond 
the .01 level except for the differences 


TABLE I 


Means ANp SD’s or Responses MapE 
DURING THE TRANSFER TEST 








Pre-conditioning Group Mean 





I. Tone-light together 6 
II. Tone-light separately Jl 
III. Light only 1 
IV. Tone only 2 
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between Group I and II (t = .34) and 
between Groups III and IV (t = 2.04). 


Discussion 


It was predicted that the group 
which made the response “now’”’ to 
both stimuli when they appeared 
simultaneously would give signifi- 
cantly more evidence of sensory pre- 
conditioning than would Groups III 
and IV where this response was made 
to but one of the two stimuli. This 
was confirmed at the .01 (I-III) and 
the .001 (I-IV) levels of confidence. 

Secondly, it was predicted that 
sensory pre-conditioning would occur 
with high frequency, even though the 
stimuli were separated in time if the 
the same reaction were made to both 
stimuli (Group II). The results sup- 
port the prediction in that (a) there 
is no appreciable or significant differ- 
ence between Group II and Group I, 
and (b) in that the frequency of con- 
ditioned responses in Group II is sig- 
nificantly greater (p = .01 and .001) 
than in Groups III and IV where the 
stimuli were also separated in time 
but a common response was not made 
to both stimuli. 

The procedure employed for Group 
II is the paradigm for mediated stimu- 
lus generalization (5). This involves 
training S to make the same response 
to two independent stimuli; condition- 
ing another response to one of these 
stimuli; then testing for the occurrence 
of this newly trained response to the 
other stimulus. Our logical analysis 
has pointed out that in the typical 
sensory pre-conditioning situation 
these conditions are met, save that in 
this instance the stimuli are presented 
together rather than independently. 
This anzlysis, together with _ the 
empirical results, offers some justifica- 
tion for the interpretation of sensory 
pre-conditioning as a form of mediated 
stimulus generalization. 
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SUMMARY 


Four groups of Ss were run to test 
the hypothesis that sensory pre-con- 
ditioning is a special case of mediated 
generalization. 

The procedure was as follows: For 
Group I a tone and light were paired 
together for 15 trials and Ss were re- 
quired to say “now” when the stimuli 
appeared. For Group II the two 
stimuli were presented independently 
(separated in time) and the same re- 
sponse (“now”) was given to both 
stimuli. Fifteen presentations of 
each stimuli were given. For 
Groups III and IV the verbal response 
was given to one, but not to the other 
stimulus. 

Groups I and II showed a high fre- 
quency of response transfer or sensory 
pre-conditioning, and they did not 
differ significantly from each other. 
Groups III and IV showed a low level 
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of response transfer. These groups 
did not differ significantly from each 
other, but both groups differed sig- 
nificantly from Groups I and II. 

The empirical results support the 
hypothesis which views sensory pre- 
conditioning as a special case of medi- 
ated stimulus generalization. 


(Manuscript received October 20, 1950) 
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THE ROLES OF PATTERN AND APPARENT DISTANCE IN 
DETERMINING THE COLOR OF AREAS SEEN 
THROUGH TRANSPARENCIES 
BY W. C. H. PRENTICE, JOSEPHINE KRIMSKY, AND STEPHEN BARKER 


Swarthmore College 


One of the recurrent problems in the 
visual perception of color, has been the 
apparent anomaly of color “trans- 
formations” under conditions of trans- 
parency. The world seen through 
rose-colored glasses is said to maintain 
a remarkably normal look after all. 
And in the laboratory situation, if we 
examine the apparent color of a yellow 
disk seen through a film of blue (which 
may be produced either by a gelatin or 
by an episcotister) we see what seems 
to be a violation of the laws of color 
mixture. That is, the yellow and the 
blue may be chosen so as to be com- 
plementary, and yet the yellow patch 
still appears yellow while it is being 
retinally mixed with the blue of the 


filter or episcotister. 

Nearly 30 years ago, Fuchs (1, 2) 
published two papers in which he dis- 
cussed certain figural influences upon 
this kind of transformation and allied 


phenomena. Five years later, Tudor- 
Hart (4) investigated certain aspects 
of what Fuchs called “assimilation” 
with what purported to be quantita- 
tive methods. Fuchs’ own investiga- 
tions had been almost entirely matters 
of qualitative observation and de- 
scription. Finally, in 1933, Heider 
(3) published a series of studies in 
which she reported further on the 
assimilation effect. 

Assimilation is the tendency for a 
particular patch of color to take on 
the appearance proper to other por- 
tions of a larger figure of which the 
patch is a part. In the case under 
consideration, a yellow surface par- 
tially masked by a blue transparency, 


that part of the yellow that is unob- 
scured is said to give a yellow appear- 
ance to the (retinally gray) part seen 
through the blue. Heider reports 
evidence that the assimilation effects 
are stronger the stronger the organiza- 
tion of the figure in which the patch 
is imbedded. 

She also reports that the “yellow- 
ness” of the figure-color may be en- 
hanced by increasing the separation 
between the plane of the transparency 
and the plane of the figure. This is 
a kind of “constancy” effect, said to 
be independent of assimilation. The 
fact that one portion of the visual 
field can be seen to lie behind another 
contributes, it is said, to seeing the 
objective (rather than the retinal) 
colors. The present paper reports the 
results of attempts to repeat Heider’s 
experiment with more carefully de- 
fined and controlled conditions. 

Reading Heider’s article today, it is 
quite clear that the experiments are 
“quantitative” only in a very limited 
sense of the word. A portion of the 
article concerns experiments which in- 
volved no measurement of any sort. 
The cases in which some form of 
measuring was actually carried out 
are, moreover, subject to various 
questions and criticisms. The num- 
ber of Os is usually too small; the Os 
are frequently not naive; and the 
statistics published are not sufficient 
to enable the reader to form any judg- 
meni about the significance of differ- 
ences. Finally, the measures are all 
stated in terms of the percentage of 
blue and yellow that satisfied the O, 
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despite the fact that the particular 
blue and yellow used were not equal 
in brightness; the result being that 
brightness judgments and hue judg- 
ments were necessarily confounded. 
For these reasons, and also for certain 
theoretical reasons, we determined to 
investigate the original contentions of 
Fuchs, Tudor-Hart, and Heider to 
see whether we could reproduce the 
results when we paid more careful 
attention to such factors as stimulus 
control in hue and intensity, sampling 
of Os, and adequate psychophysical 
methods. 


PROCEDURE 


The O sat and looked successively through 
two holes, each 3} in. in diameter, in a gray 
screen placed some 2 ft. in front of his eyes (see 
Fig. 1). Behind one hole was an adjustable 
color wheel with the standard Maxwell dise ar- 
rangement and a protractor on the back enabling 
E to shift rapidly and accurately from one setting 
to another. Behind the other hole in the screen 
stood a blue episcotister which could be raised or 
lowered. Through the episcotister a yellow 
figure (equal in brightness to the blue of the epi- 
scotister) could be seen -4 ft. away on a gray 
background. 

Four different arrangements were used. In 
Condition A, yellow arcs were pasted on the epi- 
scotister in such a way as to form part of a circle 
whose center was the center of the episcotister. 


1 
- 











la 1b 


Fic. 1. Part la is the screen that confronted 
O: he saw at A the various combinations of yel- 
low figure and blue episcotister and at B a 
Maxwell disc color mixer. Part 1b is a schematic 
view of the experimental room: (’s chair is at A; 
the screen of Part la is at B; C and D represent 
the color mixer and the episcotister, respectively; 
and E represents the screen on which the figures 
for observation were hung. 
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Fic. 2. The Wallach episcotister. Arcs of 
cardboard are pasted into place as shown so that 
the entire assembly can be spun on a color wheel, 
giving the same retinal stimulation as is given 
by a full circle of yellow seen through a blue 
episcotister. 


This arrangement (which we will refer to here- 
after as the Wallach episcotister) is shown in 
Fig. 2. The gray screen, 4 ft. away, remained 
blank in this condition. In Condition B, a yel- 
low ring was presented on the distant screen 
completely covered by the area of the episco- 
tister, so that O saw an annulus filled with and 
surrounded by blue in this condition as in Con- 
dition A, the only difference being that in Con- 
dition A, the annulus and surround lay in the 
same plane, and in Condition B, the annulus was 
visibly lying on a surface 4 ft. behind a blue 
transparency. In Conditions C and D, the epi- 
scotister was lowered so that it filled only half 
the opening of the nearscreen. In C, the ring of 
Condition B was again used, half of it being seen 
through the blue episcotister and half, unob- 
secured, above. In D, two yellow arcs, both 
curving the same way, were used, one appearing 
through and the other above the episcotister. 
In both C and D, it was apparent that the yellow 
figures lay in the same plane. 

The color wheel was composed of -the same 
yellow and blue papers of equal brightness 
(Munsell 5/5-Y and 5/5-PB). The episcotister 
was set at 252° blue, so that when the yellow 
figure behind the blue was viewed through a re- 
duction screen, it looked like a slightly bluish 
gray. It was intentionally made slightly bluish 
in order to counteract some of the contrast effect, 
since a gray figure surrounded by blue will al- 
ways take on some degree of yellow appearance. 
This tendency, operatirg as it does to give results 
like those predicted on grounds of transparency 
effects or assimilation, must be given special 
consideration and control. The daylight illu- 
mination on each of the experimental surfaces 
(the screen in front of 0, the Maxwell disc, and 
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the screen at a distance of 4 ft.) was equated by 
means of a Macbeth illuminometer. 

Conditions A and B were introduced to test 
the effect on color of transparency, i.e., of having 
the same area on the retina represent two sur- 
faces of different colors and lying in separate 
planes. In A there could be no separation of 
planes, but the same color mixture stimulated 
the retina in this and the other conditions. Be- 
cause of the 4-ft. distance between the epi- 
scotister and the screen, a very good transpar- 
ency condition existed in Condition B, and O 
saw the yellow ring lying four feet behind the 
plane of the blue transparency. Conditions C 
and D served to test the effect (on apparent 
hue) of two different degrees of figural cohesive- 
ness between inlying and outlying portions of the 
figure. According to Heider’s analysis of the 
situation, the ring should be yellower in Condi- 
tion B than in Condition A; and in C the portion 
seen through the episcotiste: should be yellower 
than the less cohesive figure of Cond. D. Ofall 
the conditions, Cond. A should presumably be 
the bluest. 

The O looked at the opening in the near screen 
and saw a yellow circle surrounded by blue (in 
Cond. C and D a yellow semi-circle surrounded 
by blue and another outside the blue, surrounded 
by gray) and then had to judge whether the 
yellow was more or less yellow than a comparison 
mixture seen through another hole in the same 
screen. 

Fifteen Os were used. ‘The order of Cond. A, 
B, C, and D was counterbalanced for different 
Os in order to control possible adaptation effects 
due to the length of time of the observations. 
Each O made a total of about 30 judgments for 
each condition in the course of determining six 
limits (using 10° step-intervals), three from blue 
to yellow and three from yellow to blue in 
ABBAAB order. A_ two-category response 
(“too blue” or “too yellow”) was required at all 
times. Between judgments, O looked away 
from the screen in order to minimize adaptation 
and satiation effects and to prevent him from 
seeing the new setting on the color wheel before 
it was started spinning. Most Os had had some 
previous experience making psychophysical 
judgments. Most, but not all, Os knew some- 
thing about the purpose of the experiment; no 
differences are apparent in the performance of 
our sophisticated and our few naive Os. 


ReEsuLTs 


In Table I are presented the mean 
number of degrees of blue (six limits 
for 15 Os) required to match the hue 


’ 


of the “yellow” figure under each 
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TABLE I 


Mean Numer or Decrees or BLvE 
Requirep To Matcu tue Hve or 
THE “YELLOW” FIGURE UNDER 
Four ConpitTions 








Mean SD 


20.5 
20.0 
16.9 
19.0 


Condition 
A. Wallach episcotister 
B. Obscured ring 
C. Half-obscured ring 
D. Broken ring 





128.7 
133.8 
132.7 
132.3 





condition. None of the differences 
approaches statistical significance. If 
one anomalous O is omitted, the re- 
sults become even more nearly the 
same, namely 132.6, 131.9, 132.7, and 
132.3 degrees under Cond. A through 
D, respectively. 


Discussion 


According to these results, it would 
appear that under our conditions no 
color constancy occurs beyond the 
yellowing predicted from knowledge 
of the contrast conditions in the con- 
trol Cond. A. It may be that this 
difference between our findings and 
those of Fuchs and Heider should be 
attributed to the larger number of Os 
used here, as well as to more careful 
control of illumination and brightness. 
Preliminary experiments have shown 
that any difference in the last vari- 
ables may cause a significant differ- 
ence in color judgments. It can be 
seen that in Cond. A, where there is 
no possible separation of planes, the 
color wheel setting is about 130° of 
blue, while the yellow figure with which 
it is being equated is actually seen 
through episcotister segments totaling 
252°. This difference of over 100° 
appears to result from the well-known 
influence of simultaneous contrast 
between the “yellow” ring and the 
inducing blue field of the episcotister. 
In Cond. B the same difference exists, 
and without Cond. A as a control 
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might have been explained by Fuchs 
and Heider as an example of color con- 
stancy or transformation due to the 
separation of planes. The usual 
method of getting rid of the separation 
of planes has been to use a reduction 
screen. Actually, however, what this 
accomplishes is more than just the 
pulling together of the colors into one 
plane. It also succeeds in excluding 
the blue surround of the mixture color. 
In our Cond. A, we have kept every- 
thing constant except the one factor 
of the separation of planes. It is 
now perfectly clear, as indeed an 
experiment of Wallach (5) indicated 
earlier, that the condition of trans- 
parency does not affect the apparent 
hue of the figure, which is instead 
wholly determined by contrast in- 
fluences. 


The question remains, how did it happen that 
Heider thought she had found a result that can- 
not be repeated under our conditions? The 
most reasonable explanation appears to be that 


she probably did not maintain precisely equal 


illuminations on the two surfaces. That is to 
say, the match which an O will make when look- 
ing at our Cond. B will be the same as that in 
Cond. A only if the total illumination on the two 
yellow rings is the same. The apparent hue of 
the yellow ring is especially sensitive to very 
small changes in illumination. We have found, 
for example, that rather minor adjustments in 
the position of a window shade will make the 
yellow ring appear either much yellower or much 
grayer. If, for some reason, Heider had a set-up 
in which the farther surface was more highly illu- 
minated, the proper proportional intensities of 
blue and yellow on the retina would obviously 
have been disturbed. 

It is rather more difficult to explain how 
Heider was able to get her reported results con- 
cerning the relationship between the cohesive- 
ness of the figure and the apparent hue. Her 
own data are not particularly convincing in this 
respect. She failed to report the data in a way 
which would permit estimation of the signifi- 
cance of differences, and it is certainly not ap- 
parent that the differences she reports are sta- 
tistically significant. Since we have been unable 
to find even the small differences that she re- 
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ports, we are inclined to believe that Heider’s 
original results are a matter of sampling error of 
some sort. 


One possible explanation of the dis- 
crepancy should be explored. In the 
well-known Benussi circle (Fig. 3), 
assimilation (i.e., uniformity of color 
in the ring) is achieved when the ring 
is unbroken by any sharp contour. 
It is possible that Heider’s conditions 
and ours may have differed in this 
respect. While it is true that the 
edge of the episcotister would ordi- 
narily provide such a contour and clear- 
ly did so in our conditions, there is one 
situation in which that might not be 
thecase. Ifthe brightness of Heider’s 
background (against which the yellow 
figures appeared) was identical to the 
brightness of the episcotister surface, 
then contour formation would have 
to be achieved by hue differences 
alone. There is no doubt that this 
would blur the contour to some extent. 
Whether that fact alone would suffice 
to account for the difference in our 
two results is a matter for investiga- 




















Fic. 3. The Benussi circle. When a heavy 
contour breaks the gray circle along the line 
that separates the two background colors, the 
simultaneous contrast effect is enhanced dramati- 
cally so that the pinkish tinge of the right-hand 
half of the ring and the greenish tinge of the left 
become very strong instead of barely noticeable. 
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Another possibility is that Os who 
are naive or Os who have not had a 
good deal of experience making psycho- 
physical judgments, might easily be 
persuaded that the colors under the 
two conditions (solid circle and broken 
circle) are different. It is not easy to 
judge the portion of the total distant 
field seen through the episcotister and 
judge it independently of that portion 
of the same yellow paper that is seen 
outside the field of the episcotister. 
When one is confronted with the un- 
contaminated yellow and the yellow- 
blue mixture simultaneously, it is 
very easy to “interpret” the yellow- 
blue mixture as being a yellow paper 
seen through a blue screen. Under 
these conditions, if the matching ex- 
periment is not carried out carefully, 
O may very easily get the impression 
that this judgmental reaction is a 
perceptual one. It is only when he 


compares the part of the yellow field 
that is seen through the episcotister 


directly to the Maxwell disc that it 
can be shown that what he sees is not 
different from what he sees in the case 
where the entire yellow field resides 
behind the episcotister. A naive O 
who is first shown the yellow ring seen 
entirely through the episcotister and 
who is then shown the yellow ring 
slid up until half of it appears beyond 
the edge of the episcotister is likely 
to react by saying “Oh.” This re- 
action is a matter of his sudden rec- 
ognition that the yellow ring is really 
a much more saturated yellow than he 
had realized before, although he had 
recognized it as yellow rather than 
gray from the beginning. Under the 
influence of this attitude, the O may 
change his interpretation of the gray 
portion of the total field and attend to 
it as a portion of the now yellower 
circle merely obscured by a blue mist 
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or film. We considered the possibil- 
ity, in a sense that is rather difficult 
to define or specify, that the bottom 
part of the circle might now somehow 
look yellower for some Os than it did 
before. This possibility would be 
even more likely if O had some ex- 
pectations or theoretical presupposi- 
tions about the conditions beforehand. 

The procedure of our first experi- 
ment might actually obscure such a 
result by requiring O to make a long 
series of somewhat artificial judg- 
ments. As a consequence, we set up 
a simpler test. In the second experi- 
ment, the same conditions of observa- 
tion were used, but a gray field con- 
taining two small (2-in. diameter) 
yellow discs was exposed. One of the 
discs was seen only through the epi- 
scotister; the other was partially ob- 
scured by the episcotister and par- 
tially exposed to direct view. The Os 
were asked to make a simultaneous 
comparison between the two yellows 
seen through the blue film. Six 
naive Os reported that they could see 
no difference. Three sophisticated 
Os gave exactly the same result. 


SUMMARY 


Using careful quantitative methods 
by which Os judged the apparent yel- 
lowness of a figure seen either through 
or surrounded by a blue of the com- 
plementary hue, we have been unable 
to find any influence of the separation 
of planes or of assimilation. Earlier 
authors had reported a distinct en- 
hancement of the contrast effect 
under conditions of transparency and 
of “strong figures.” It is suggested 
that their results were due to inade- 
quate control of illumination or sam- 
pling error. 


(Manuscript received October 21, 1950) 





206 W. C. H. PRENTICE, J. KRIMSKY, AND S. BARKER 


REFERENCES 3. Hemwer, Grace, M. New studies in trans- 
parency, form, and color. Psychol. 
Forsch., 1933, 17, 13-55. 

. Tupor-Hart, Beatrix. Studies in trans- 


1. Fucus, W. Experimentelle Untersuchungen 
uber die Anderung von Farben unter dem 
Einfluss von Gestalten (“Angleichungs- 

: b> parency, form, and colour. Psychol. 
a Forsch., 1923, 10, 255-273. 

2. Fucus, W. Untersuchungen iiber die psy- >. WAtLAcH, H., anv Gattoway, A. Con- 
chologischen Grundprobleme der Tiefen- stancy of colored objects in colored illu- 
wahrnehmung. Z. Psychol., 1923, 92, mination. J. exp. Psychol., 1946, 36, 
294-325. 119-126. 








THE ABILITY TO MAKE ABSOLUTE 
JUDGMENTS OF PITCH! 


BY ALICE VAN KREVELEN ? 


University of Rochester 


The ability known as absolute pitch 
has been a subject of interest for many 
years. Neu (6) recently reviewed the 
work of 58 authors of which, however, 
comparatively few used Ss already 
possessing the requisite absolute abil- 
ity. Petran (7) in a study resembling 
the second experiment in the present 
investigation did observe nine Ss with 
absolute pitch. His article reviews 
the literature up to 1932. Bachem 
(1, 2,3, 4) has made the most recent 
investigation of known possessors of 
absolute pitch. Nevertheless, a more 
precise, quantitative report of the per- 
formance of people possessing this 
ability seems necessary in view of the 
meagre data describing their accuracy 
in any finer units than semitones. 


PROBLEM 


This investigation was an attempt 
to determine how accurately judg- 
ments of pitch could be made by per- 
sons definitely possessing the ability 
known as pitch recognition or absolute 
pitch. An unusual degree of precision 
in frequency monitoring was utilized 
in an attempt to discover the limits of 
the ability. 


1 This paper is a portion of a dissertation sub- 
mitted to the Department of Psychology of the 
University of Rochester in partial fulfillment of 
the requirements for the Ph.D. degree. The 
writer is indebted to Professor Edward H. Kemp 
of the Department of Psychology and to Pro- 
fessor Wayne Barlow of the Eastman School of 
Music for advice and assistance throughout the 
investigation. The research was supported in 
part by funds from the American Otological 
Society. The Conn chromatic stroboscope 
used in this investigation is the property of the 
U. S. Navy, and was loaned for the experiment. 

? Now at Hollins College, Virginia. 


Two experiments were performed. 
In one experiment Ss were required to 
identify randomly presented oscillator 
tones which covered the range from 
404 to 478 cps in 2-cps steps. This 
will be referred to as the “recognition 
method.” In the second experiment 
the same Ss attempted to adjust oscil- 
lator frequency so as to produce G# 
(415.30 cps), A (440 cps) and 4% 
(466.16 cps). This is called the “‘pro- 
duction method.” 

Previous studies have shown that 
musical Ss judge best those tones lying 
in the middle range of the musical 
frequency scale. Since it was de- 
sired to get a measure of the very 
best performance that could be ex- 
pected, the three tones to be investi- 
gated were selected from this middle 
range. It is unfortunate that only a 
very small number of frequencies 
could be thoroughly studied within 
the amount of time which the Ss 
could reasonably be asked to give. 


SUBJECTS 


The Ss were chosen according to Bachem’s 
criterion for universal genuine absolute pitch (1). 
All students in the Eastman School of Music 
during the academic year 1948-49 who had given 
evidence of absolute pitch ability by identifying 
piano tones on an entrance test were asked to 
report for a second test. For this group test all 
Ss were required to identify all 88 tones on the 
piano. Each key was held down for approxi- 
mately 1 se:. and the time between notes was 
just sufficient for Ss to write down the proper 
tone if recognized immediately. 'The papers 
were collected as soon as the last note was given 
so that there was no time for Ss to check over 
their answers. The sequence of tones was ar- 
ranged to rule out the effects of relative pitch as 
far as possible: no two consecutive tones were 
taken from the same octave and the smallest 
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intervals were those usually considered most 
difficult to identify, such as major and minor 
ninths, augmented elevenths, fourteenths, etc. 
Twenty-three Ss were selected from the 40 who 
took the test. These were Ss who either identi- 
fied all 88 notes correctly or who made occasional 
half-tone errors in the extreme upper and lower 
octaves, thus conforming to Bachem’s criterion. 

This group was given a third test with pure 
tones produced by means of an oscillator and 
headphones. This was an individual test in 
which S gave his judgments aloud. These were 
recorded by E. Since the experimental tones 
were not to be taken from the extremes of the 
scale only the 48 tones lying in the middle four 
octaves were used. The sequence of tones was 
again arranged to rule out the effects of relative 
pitch as far as possible. 

The 17 Ss (10 women and 7 men) who identi- 
fied all 48 oscillator tones correctly were selected 
for the study. The age range of these Ss was 
from 17 to 37 years with the majority between 
the ages of 17 and 22. All Ss filled out a ques- 


tionnaire before the experimental sessions began. 
They gave information concerning their musical 
history, family musical background, etc. (Table 
I). The mean age of beginning formal music 
study was five years and all the Ss were engaged 
in the professional study of music with the inten- 
tion of pursuing it as a career. 


TABLE I 


DeEscripTION oF SUBJECTS 








Age 
Began 
Music 
Study 


Music 
Degree 
Sought 


Musical 
Parents* 


Major In- 
strument 
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Clarinet 
Voice 
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* Includes both amateur and professional. 
** Also had an aunt with claimed absolute pitch. 
*** Also had a brother with claimed absolute pitch. 
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APPARATUS AND PROCEDURE 


Experiment I.—The experiment was carried 
out in a small room having sound-deadening wall 
panels and protected from hall noises by two 
closed doors while the sessions were in progress. 
The room was in an unusually quiet location, 
and at no time were there any musical sounds 
audible during an experimental session. 

The S, wearing Permoflux PDR-8 earphones 
mounted in sponge rubber, sat at a small table 
shielded from the E and equipment. The tones 
used were produced by a Hewlitt-Packard 
Audio-Oscillator, Model 200D. Sensation 
threshold was determined for each S at 422 cps, 
which was midway in the range of frequencies to 
be used, and all tones were presented at an in- 
tensity 40 db above this level. The output of 
the oscillator was connected through an at- 
tenuator into the earphones while a Conn 
chromatic stroboscope was used to monitor the 
input to the attenuator (8). 

The E tuned the oscillator while watching the 
stroboscope pattern until the desired frequency 
was being produced. She then pressed a tele- 
graph key which led the tone to the earphones of 
S and held the key down for approximately 1 sec. 
No attempt was made to eliminate a click be- 
cause the duration seemed long enough to mini- 
mize the disturbing effect of transients. Ap- 
proximately 10 to 15 sec. elapsed between tones 
in the series. 

Tones were presented to S in random order 
and the same frequency appeared ten times in 
the series. The tones range from 404 cps, 
about a quarter tone below G# to 478 cps, a 
quarter tone above 4%, in 2-cps steps. Two 
exceptions had to be made, however, in order to 
include the correct frequency for both G# and 4, 
one being an odd number of cycles and the other 
aneven. The total series consisted of 390 tones. 
The order was prearranged according to a table 
of random numbers and was the same for all Ss. 
The S received 100 tones at each session. 

The following instructions were given at the 
first session and S was briefly reminded of 
them at the beginning of subsequent sessions: 
“You will hear a series of tones. Some are 
on pitch and some are not. We are inter- 
ested in finding out just what you will accept as 
sounding like a good C, C#, etc. For instance, 
suppose you hear a tone lying between D and 
Dé; if it sounds closer to D report it as a sharp 
D; if it sounds closer to D# report it as a flat Dz. 
In other words always report in terms of the 
closest musical tone. If you say just D, D#, etc., 
we will understand that the tone sounds exactly 
on pitch to you. Report your judgment aloud 
after each tone and I will write it down over 
here.” 
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The Ss were not aware of the limits of the 
frequency range to be investigated as it was not 
desired to give them any frame of reference other 
than a purely subjective one on which to base 
their judgments. 

Experiment I1.—The apparatus used was 
the same as in Experiment I. This time S sat 
at a small table next to £’s table but separated 
from it by a partition. Through an opening in 
the partition he could insert his hand and arm 
and turn the dial of the oscillator without being 
able to see the dial or the oscillator. The oscil- 
lator was monitored by the stroboscope as in 
Experiment I and the tones were presented at 
the same intensity. The tones requested were 
G#, A and Az. In the total series S “found” 
each tone 100 times, 50 times from original 
settings below the goal tone and 50 times from 
above. Five preliminary settings were chosen 
in 10-cps steps around a musical fifth above and 
below the tone being requested. Therefore, the 
original settings were never exactly on pitch for 
any tone. Each setting was used ten times in 
the series. The order in which the tones were 
requested and the original settings were ran- 
domized and the same random order was used 
for all Ss. 

These Ss were given the following instruc- 
tions: “This time you will be able to change the 
tone yourself by turning the oscillator dial. 
You can vary the frequency until it sounds like 
the tone I am asking for. One way to do this 
is to go to the tone you want and then make it 
either a little sharp or a little flat, depending on 
whether you are coming from above or below the 
tone, and then go back to where you hear the 
tone you want again. If you prefer to go slowly 
until you héar the tone you want and then stop, 
that is all right too. We want you to do it the 
way you get the best results. However, do not 
go as much as a half tone above or below the 
tone you are looking for when making final ad- 
justments. You can have a few trials to see 
which way to turn the oscillator dial to make 
the tone go up or down and how much you have 
to turn to make a certain amount of change. 
Then I will ask you to produce a note. As soon 
as you hear a tone in your ear start turning the 
dial to get what you want. All the notes I ask 
for are to be in the middle-C octave, the same 
octave as the tores you listened to in the other 
part of the expetiment. As soon as you have 
the tone you want remove your hand from the 
dial and I will know that you have finished 
tuning.” 

As soon as E had tuned to the desired pre- 
liminary frequency the tone was keyed into the 
earphones and the key was held down until S 
removed his hand from the dial to signal that 
he had finished tuning. The tone was then 
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keyed out of the earphones and £ turned the 
stroboscope dial until some stroboscopic fre- 
quency coincided with the frequency being pro- 
duced by the oscillator. The reading was re- 
corded in cents (.01 semitone) deviation from the 
goal tone, both magnitude and direction of error 
being noted. About 50 judgments were made 
at each session. 


RESULTS 


Exp. I: Recognition method.—The 
individual means and standard devi- 
ations for all Ss for the recognition 
method are presented in Table II. 
The means are the averages of all the 
frequencies that were judged by the Ss 
to sound exactly on pitch for each 
note. It should be mentioned that 
510 of the total of 6,630 tones heard 
by the 17 Ss were actually on pitch 
for G#, A or A#. A little better than 
one-third (2,313) of the judgments 
made were “on pitch” judgments; 
more than four times the number of 
tones that were on pitch were called 
“on pitch.” Furthermore, of the 
170 true G#s presented, 44 per cent 
were named correctly while the re- 
mainder were judged to be either sharp 
or flat; 47 per cent of the 170 4’s and 
52 per cent of the 170 4#’s were rec- 
ognized correctly. The SD, in cps, 
was calculated around the S’s own 
mean and is an index of the consist- 
ency of his judgments. The constant 
error may be seen by comparing the 
mean with the accepted physical 
standard for a particular tone, based 
on an A of 440 cps. 

The constant errors were larger for 
4% than for G# and A for the group as 
a whole. The Ss were also less con- 
sistent in their judgments for 4# than 
for 4 as measured by the SD’s. The 
mean SD’s for the group, obtained by 
summing the individual SD’s and 
dividing by the number of Ss, were 
7.57 cps for G#, 6.40 cps for 4 and 
7.11 cps for 4%. These SD’s are of 
the magnitude of about .34 semitone. 
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TABLE Il 


PERFORMANCE OF SuBjects Usinc THE RecoGcnition MetTHop 
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Reported as G# 


Reported as A Reported as A# 





Mean in cps SD in cps 


Mean in cps 


SD in cps Mean in cps SD in cps 





414.86 
413.40 
441.50 
417.60 
420.20 
416.96 
431.81 
426.50 
423.27 
419.30 
423.18 
421.05 
413.96 
416.12 
422.27 
424.36 
413.24 


4.56 
4.29 
14.20 
6.86 
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437.18 
434.36 
453.92 
438.09 
437.57 
433.47 
449.37 
445.47 
445.95 
440.53 
444.77 
441.41 
441.38 
436.64 
439.00 
441.61 
437.74 


5.39 
4.51 
12.23 
6.22 


463.57 





Mean of pooled 
judgments 421.08 


440.43 460.30 








Standard frequency 415.30 











440.00 466.16 











Figure 1 shows the pooled “on 


pitch” judgments for all Ss. It will 
be seen that the peaks for all tones 
occur at frequencies coinciding with 
the true standards but that there is 
also overlap between categories and 
the range for any one category is 
quite wide if we consider all the judg- 
ments that were made. There is 
even a small range from about 440 
cps to about 452 cps where all three 
categories were used in identifying the 
tones. 

The curves, as viewed roughly, 
seem to represent fairly normal dis- 
tributions of judgments around peaks 
coinciding with the true values of the 
particular notes. The curve for G# 
appears to be cut off at the lower end 
and that for 4% at the upper end. 
This is probably due to the experi- 
mental conditions since the frequency 
of 404 cps was the lowest that was 
given and that of 478 cps was the 


highest that was given. If the fre- 
quency range had been extended in 
either direction it is probable that a 
few Ss would still have occasionally 
used the categories of G# and 4# and 
these curves would correspond with 
that for 4. 

Means were calculated for each 
category of the pooled judgments of 
all 17 Ss. These means were found 
by averaging all the frequencies that 
were judged to sound exactly on 
pitch for each note. 

Exp. IIT: Production method.—Table 
III contains the individual means and 
SD’s for all Ss for the production 
method. The individual means are 
the averages of all the settings that 
were made by each S for a particular 
tone and the SD’s are calculated 
around these means. The constant 
errors can be calculated from the ac- 
cepted physical standard for each 
note, based on an A of 440 cps. 
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Fic. 1. Pooled distributions of actual frequencies of tones judged by 17 Ss 
to be G#, A, and 4% (Recognition Method) 


TABLE III 


PERFORMANCE OF Susjects Usinc THE Propuction Metuop 
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Selected asG¢ ? | Selected as A Selected as A# 
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Mean in cps SDincps | Mean in cps SD in cps Mean in cps SD in cps 








411.25 4.60 431.69 
411.96 4.79 435.20 
424.03 11.25 451.33 
412.44 6.11 439.24 
416.75 4.08 439.75 
411.72 4.24 431.44 
425.50 6.43 442.80 
412.44 4.48 437.21 
416.99 4.66 437.97 
417.95 4.54 437.72 
414.59 5.46 441.53 
412.44 5.47 435.45 
416.75 3.19 443.32 
418.43 3.34 437.21 
416.99 5.46 437.47 
412.67 5.05 442.55 
417.23 2.92 439.75 

Mean of pooled 
judgments 415.67 438.73 460.21 
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POOLED JUDGMENTS 


17 SUBJECTS ~ PRODUCTION METHOD 
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Fic. 2. Pooled distributions of actual frequencies of tones judged by 17 Ss 
to be G#, 4, and 4# (Production Method) 


Figure 2 presents graphically the 
pooled judgments of the group for the 
production method. Again, as in the 
recognition method, the judgments 
for each tone are distributed around a 
peak which coincides with the true 
physical standard for that particular 
category. 

Means were calculated by pooling 
the judgments of all 17 Ss. These 


values differ only slightly from the 
values obtained when the 17 individ- 
ual means were averaged. 
Comparison of methods.—It may be 
seen by referring to Tables II and III 
that the individual Ss seem to be more 
consistent in their judgments (they 
show smaller SD’s) if the production 
method is used rather than the rec- 
ognition method. The averaged SD’s 


TABLE IV 


DiFFERENCES BETWEEN Mean SD’s oF JUDGMENTS UNDER THE CONDITIONS 
oF RECOGNITION AND PropucTION 











Recognition Method 


Production Method | 
| 


Difference 





Mean SD 
in cps 


| SD 





GF 


A 
A 


7.57 
6.40 
7.11 


2.56 
2.15 
1.64 











* Significant at the .02 level of confidence (df = 16). 


** Significant beyond the .01 level of confidence (df = 16). 
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TABLE V 


DIFFERENCES BETWEEN MEAN AVERAGE Errors OF JUDGMENTS UNDER THE 
Conpitions oF RECOGNITION AND PropucTIon 








Recognition Method 


Production Method 





Mean in 
Semitones 


Mean in 
Semitones 


Mean in 


SD Semitones 





342 
.234 
.269 


.293 
290 
.325 


G# 
4 
At 











049 
— .056 
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084 
090 











* Significant at the .05 level of confidence (df = 16). 


for the group are also smaller for the 
production method. 

The t-test was applied to differences 
in performance under the two condi- 
tions as measured by the SD’s (Table 
IV). The formula used to find the 
standard error of the mean difference 
is that used for matched pairs (5). 
For G# at of 5.85 was obtained. This 
is significant at better than the .01 
level of confidence. The #-value for 
A was 2.87 and that for 4# was 1.75, 
the value for 4 being significant at 
better than the .05 level, and the 
value for 4# is not significant. So it 
can be said that for the tones G# and 
A the Ss were more consistent in their 
judgments when the production 
method was used. 

The t-test was also applied to dif- 
ferences in performance under the two 
conditions as measured by the average 
error taken from the true standard for 
each particular tone (Table V). The 
SD around the S’s own mean is an 
indication of the consistency of his 
judgments but the average error taken 
from the true standard is a reflection 
of the accuracy of his judgments. 
For G# the t was 1.11 indicating that 
there was no significant difference be- 
tween accuracy of performance in the 
two experiments. The ¢ for 4 was 
2.54 showing a difference significant 
at better than the .05 level of confi- 
dence in favor of the recognition 


method. Likewise, the ¢ for A4# 
(2.33) was significant at better than 
the .05 level and also in favor of the 
recognition method. We must con- 
clude that, while the Ss were more con- 
sistent in their judgments when they 
could control the frequency them- 
selves, they were more accurate in 
their judgments for 4 and 4# when 
they identified the heard tones. 


Discussion 


One of the objects of this investiga- 
tion was to discover how wide a band 
of frequencies would be accepted as 
sounding on pitch for a given tone. 
The results which answer this question 
are shown graphically in Fig. 1 and 
2, and have been discussed above. 
We conclude that no sharp judgmental 
limits for all Ss for any one tone were 
demonstrated by this experiment.* 

Means were calculated to give an 
idea of the central tendency in making 
judgments and for comparison with 
the true physical standard for each 
particular tone. Tables II and III 
give the means of the pooled judg- 
ments for all 17 Ss for both methods. 
The mean for G# was higher than the 
standard by ‘about 6 cps when the 


* For a more complete analysis and description 
of the performance of each individual in this 
experiment the reader is referred to a thesis on 
file at the University of Rochester under the 
same title as this article. 
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recognition method was used and 
deviated less than 1 cps from the 
standard when the production method 
was employed. For judgments of 4 
the mean values in both experiments 
were within approximately 1 cps of 
the true value. The means for 4# 
were approximately 6 cps lower than 
the standard under both the recogni- 
tion and production conditions. It 
should be noted, however, by referring 
to Fig. 1 and 2, that the modal value 
of the pooled judgments actually 
coincides with the true physical 
standards. When the 17 individual 
means were averaged for each method 
they were very similar to the means 
of the pooled judgments, which per- 
haps indicates that the group is 
fairly homogeneous. 

Consideration of the means alone 
tells us nothing of the variability. 
The consistency and accuracy with 
which judgments were made is best 
indicated by the SD’s and the average 
errors. 


The mean average errors, in 
general, for both methods and for all 
tones were of the approximate magni- 


tude of .3 semitone. Bachem (I, 2, 
4) has reported average errors of .1 
semitone for his Ss. Inthe present ex- 
periment no single S made an average 
error as small as .l1 semitone. One of 
the 17 Ss had average errors only 
slightly higher than .1 semitone for 
three of the tones, one with the pro- 
duction method and the other two 
with the recognition method. How- 
ever, the performance of Ss in the 
present experiment cannot readily be 
compared with that of Bachem’s Ss 
since it is not easy to deduce from 
Bachem’s publications exactly how 
he arrived at the figure of .1 semitone. 

The differences, pointed out earlier, 
between the two methods hardly seem 
reasonable on first consideration. 
Why should the “production” method 
give the more consistent judgments 
while the “recognition” method gives 
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the more accurate ones? One possi- 
ble explanation for this result may be 
as follows. Inthe production method 
the S was never given any frequencies 
that were exactly on pitch for pre- 
liminary settings but arrived at his 
idea of the goal tone only by tuning 
through a series of intermediate fre- 
quencies until he reached a point 
which satisfied his criterion. He may 
have been satisfied with a setting 
which was some distance away from 
the true standard on the first setting 
and the very consistency with which he 
turned to this same tone increased his 
average error. When SD’s are com- 
pared for the two methods they are 
significantly smaller for the produc- 
tion method for the tones G¥ and A. 
It would seem that S, being able to 
control the frequency himself, could 
establish his own criterion for a tone 
and turn to it quite consistently. 

It may be argued that only the first 
judgment made in any session is with- 
out reference to any other heard tone. 
In the production experiment even the 
first judgment would not satisfy this 
requirement since S tuned through 
several intermediate frequencies to 
reach his goal tone. But when S is 
given no outside cue as to the correct- 
ness of his judgments, the fact that he 
has just heard another “unknown” 
tone would seem to be of little help. 
Furthermore, even if it were desirable 
to use only the first judgment each 
day, the time required to obtain 
statistically reliable results would 
make such experimentation a practical 
impossibility. 

Petran (7) characterized his Ss with 
absolute pitch as having the ability 
to make the majority of their judg- 
ments using the production method, 
within G#% to 4% when they were try- 
ing to produce 4. The 17 Ss in the 
present experiment were apparently 
more accurate, since only occasional 
settings varied as much as a half-tone 
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from the goal tone and average errors 
were of the magnitude of .3 semitone. 
It is possible that if average errors 
were calculated for all settings made 
by Petran’s best Ss they would have 
been of the same order of magnitude. 
Petran does not describe hjs method of 
selecting Ss so there is a possibility 
that not all of his nine Ss would have 
satisfied Bachem’s criterion for genu- 
ine absolute pitch. 

Bachem mentions two factors as 
being of importance in the develop- 
ment of absolute pitch: (a) an atten- 
tion factor, and (b) an inheritance 
factor. In this group of Ss possessing 
genuine absolute pitch only two Ss 
reported any relatives with absolute 
pitch. Since the mean age of begin- 
ning formal music study was five years 
and since most Ss reported either one 
or both parents as being engaged in 
music either professionally or on an 
amateur basis, it is safe to assume 
that these Ss were hearing musical 
tones at a very early age. In these 
particular Ss, therefore, the factor of 
primary importance may have been 
attention to tones at an early age. 


SUMMARY AND CONCLUSIONS 


Seventeen Ss, selected according to 
Bachem’s criterion for genuine abso- 
lute pitch, performed in two experi- 


ments. In one they were required to 
identify randomly presented oscillator 
tones which covered the range between 
404 and 478 cps in 2-cps steps. Inthe 
second ‘experiment the same Ss at- 
tempted to adjust oscillator frequency 
so as to produce G# (415.30 cps), 4 
(440 cps), and 4% (466.16 cps). 

The Ss were more consistent in 
their judgments, as measured by SD’s, 
for two of the tones when the produc- 
tion method was used than when the 
recognition method was used. How- 
ever, performance was more accurate, 
as measured by the average error, for 
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two of the tones with the recognition 
method. 

The range of individual means 
shows that, as judged by the perform- 
ance of this group, there is no particu- 
lar frequency which is agreed upon by 
all individuals to correspond to a par- 
ticular tone; but when all judgments 
for all Ss are pooled the greatest num- 
ber of judgments coincide with the 
true physical standard for a given 
tone. This would seem to indicate 
that individual experience is import- 
ant in establishing the tonal norm for 
an individual and that the norm for 
these selected Ss usually falls rather 
near the true physical standard. 
Even with the established individual 
norms the Ss differed in the consist- 
ency with which they could retain 
their own standards through a long 
series of judgments. 

The group average error for both 
methods was of the approximate 
magnitude of .3 semitone. This is 
greater than Bachem’s estimate of .1 
semitone for his Ss. 


(Manuscript received October 15, 1950) 


REFERENCES 


. Bacnem, A. Various types of absolute 
pitch. J. acoust. Soc. Amer., 1937, 9, 
146-151. 

. Bacnem, A. Genesis of absolute pitch. /. 
acoust. Soc. Amer., 1939, 11, 434-439. 

. Bacnem, A. Chroma fixation at the ends of 
the musical frequency scale. /. acoust. 
Soc. Amer., 1948, 20, 704-705. 

. Bacnem, A. Note on Neu’s review of the 
literature on absolute pitch. Psychol. 
Bull., 1948, 45, 161-162. 

. Epwarps, A. L. Statistical analysis. 
York: Rinehart, 1946. 

. Nev, D. M. A critical review of the litera- 
ture on “absolute pitch.” Psychol. Bull., 
1947, 44, 249-266. 

. Petran, L. A. Experimental study of pitch 
recognition. Psychol. Monogr., 1932, 42, 
No. 6. 

. Younc, R. W., anp Loomis, A. Theory of 
the chromatic stroboscope. /. acoust. 
Soc. Amer., 1938, 10, 112-118. 


New 





JOURNAL OF EXPERIMENTAL PSYCHOLOGY 


The issues listed’ below are available for sale 


. Price per 
vailable Numbers Number 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 
$1.25 


By subscription, $14.00 (2 vols.) $1.25 


Volume 


> 


| BEBES I 
SSAsss 


Nw 


CAONOULWN— 
an 


NVWNNNYNNNNNNNNNNNHNNNHNNNNNNNNNNNN | NNN! | WN NH 
WOWWWWWWWWWWWwWWWWWWWwWwwWWwWwWWWwWwWwwWwWwwWw! | www i w 
LLLP PEA LALA LAL HL PEPE LPS EPL EPP PPE | S| | PPP PP 
AMAMAAAAAAAAMAAAAAAMMAAAAAAAAAaaaaaa!) Al Aaanan 
AADADAADADAADADADAAAAAADAGADAGDAGAAGDAGDAGQGA! | AQaani a 


1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 


VISSSSSSSSSSSSSSSSSIIISS SSS VSIARSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSESSSSS 


List price, Volumes 1 through 40 
30% discount 


Net price, Volumes 1 through 40 


Information about the Journal of Experimental Psychology: as noted in the table, the journal 
was not published in 1918 and 1919; from 1937 to 1943 it was published at the rate of two volumes 
per year; from 1944 to 1950, in one volume per year. Beginning in 1951 there are two volumes 
a year. 


Information about prices: when all numbers are available, the price per volume throughout 
the series is $7.00. For incomplete volumes, the price is $1.25 for each available number. 
For foreign postage, $.25 per volume should be added The American Psychological Association 
gives the following discounts on orders for any one journal: 

10% on orders of $ 50.00 and over 
20% on orders of $100.00 and over 
30% on orders of $150.00 and over 


Address orders and subscriptions to 


AMERICAN PSYCHOLOGICAL ASSOCIATION 
1515 Massachusetts Avenue N.W. Washington 5, D. C 





EARLY PUBLICATION 


IN 


APA JOURNALS 


The policy of accepting articles for immediate publication (providing the 
editor accepts the article and the author is willing to pay the entire cost of 
increasing the next available issue by enough pages to add his article to the 
normal content) is now standard practice for all APA journals except 
Psychological Abstracts and the American Psychologist. 


The actual charge made to the author includes three items : 
Mh: Aes eS ee This is the minimum amount that 
it costs to add an additional page to the journal. For 1951 these costs are: 

JouRNAL OF ABNORMAL AND SOCIAL PsyCHOLOGY 

JouRNAL oF CONSULTING PsyCHOLOGY 

JournNnat or COMPARATIVE AND PHYSIOLOGICAL PsyCHOLOGY. . 

Journat or ConsuLTiInG PsycHOLocy 

JourRNAL or EXPERIMENTAL PsyCHOLOGY 

PsyCHOLOGICAL BULLETIN 


PsycHOLOGICAL REVIEW 

* Since each Psychological Monograph is printed separately, the author of one handled on 
an early publication basis can be charged exactly the cost of printing. The figure of $15.00 is 
an approximate one; the actual figure will be higher for very short monographs and lower for 
very long ones. The cost will also vary depending upon the amount of special composition and 
the illustrations used. 


These charges are based upon several factors : 

(a) The greater number of words on a particular journal page, 
the higher the cost per page. Conversely, the fewer words 
printed on the page, the lower the cost per page. 

(b) The more copies which must be printed, the higher the cost. 

(c) The more expensive the printer, the higher the cost. Com- 
pared to the factors listed above, this is not an important 
difference in the charges made. 


2. The full cost of any cuts or other illustrative material, of special com- 
position for tables, and of author’s changes in proof. 


3. The full cost of any reprints ordered. (Authors of early-publication 
articles do not receive any free reprints.) 











Classics Among 
PSYCHOLOGICAL MONOGRAPHS 


Thorndike, E. L. The Mental Life of the Monkey. 1899, #15. 8.50 


Carr, Harvey. Visual Illusion of Movement During Eye Closure. 
1905, #31. $1.26 


Watson, John B. Kinaesthetic and Organic Sensations: Their 
Role in the Reactions of the White Rat tothe Maze. 1907, #33, 
$1.00 


Shepherd, William T. Some Mental Processes of the Rhesus 
Monkey. 1910, #52. $8.76 


Franz, Shepard Ivory and Lafora, Gonzalo R. On the Func- 
tions of the Cerebrum: The Occipital Lobes. 1911, #56. $1.26 


Fernberger, Samuel W. On the Relation of the Methods of Just 
Perceptible Differences and Constant Stimuli. 1912, #61. $1.00 


Boring, Edwin G. Learning in Dementia Praecox. 1913,#63. $1.00 


Langfeld, Herbert S. On the Psychophysiology of a Prolonged 
Fast. 1914, #71. $8.76 


Franz, Shepard Ivory. I. Symptomological Differences Asso- 
ciated with Similar Cerebral Lesions in the Insane. II. Varia- 
tions in Distribution of the Motor Centers. 1915, #81. $1.60 


Peckstein, Louis Augustus. Whole versus Part Methods in 
Motor Learning. 1917, #99. $8.75 


Kjerstad, Conrad L. The Form of the Learning Curves for 
Memory. 1919, #116. $1.35 


Tolman, Edward C. Retroactive Inhibition as Affected by Con- 
ditions of Learning. 1918, #107. $8.75 





MANY OF THE EARLY MONOGRAPHS ARE OUT-OF-PRINT. 
ONLY A LIMITED QUANTITY OF THE ABOVE 
NUMBERS ARE AVAILABLE. 





AMERICAN PSYCHOLOGICAL ASSOCIATION 
1515 Massachusetts Ave. N.W., Washington 5, D. C. 











